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Porcine Retrovirus 

The present invention relates inter alia to porcine 
retrovirus (PoEV) fragments, in particular polynucleotide 
fragments encoding at least one porcine retrovirus expression 
product, a recombinant vector comprising at least one 
polynucleotide fragment, use of PoEV polynucleotide fragments in 
the detection of native porcine retrovirus, a host cell 
containing at least one PoEV polynucleotide fragment or a 
recombinant vector comprising at least one PoEV polynucleotide 
fragment, PoEV polypeptides, antibodies immuno-reactive with PoEV 
polypeptides, pharmaceutical compositions comprising recombinant 
PoEV polypeptides for use as prophylactic and/or therapeutic 
agents and uses of PoEV polynucleotide fragments and/or 
polypeptides in medicine, including veterinary medicine and in 
the preparation of medicaments for use in medicine, including 
veterinary medicine . 

Porcine retrovirus (PoEV) is an endogenous (genetically 
acquired) retrovirus isolated from pigs and expressed in cell 
lines derived from porcine material. There are no known 
pathogenic effects associated with the virus per se in its 
natural host although the virus appears to be associated with 
lymphomas in pigs and related viruses are associated with 
leukaemias and lymphomas in other species- The virus has been 
reported to infect cells from a variety of non-porcine origins 
and is, therefore, designated as a xenotropic, amphotropic or 
polytrophic virus (Lieber MM, Sherr CJ. Benveniste RE and Todaro 
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GJ. 1975; Strandstrom H, Verjalainen P, Moaning V, Hunsmann G, 
Schwarz H, and Schafer 1974; Todaro GJ, Benveniste RE, Lieber 
MM and Sherr CJ. 1974) • The observation that the above viruses 
may have the potential to infect humans and have a pathogenic 
effect suggests that the issue of porcine retroviruses roust be 
addressed in the context of xenotransplanting pig tissues or 
cells. Therefore, information on the properties of PoEV and the 
development of diagnostic reagents, molecular engineering tools 
and potential vaccine materials is of paramount importance for 
example in xenotransplantation technology and the like. 

It is an object of the present invention to obviate and/or 
mitigate against at least some of the above disadvantages. 

In one aspect the present invention provides an isolated 
PoEV polynucleotide fragment: 

(a) encoding at least one porcine retrovirus (PoEV) 
expression product; 

(b) encoding a physiologically active and/or immunogenic 
derivative of said expression product; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) • 

Preferably, the polynucleotide fragment encodes the gag gene 
(gag) , polymerase gene (pol) and/or envelope (env) gene of PoEV. 
Thus, said expression product can be the virion core polypeptides 
(GAG) and polymerase (POL) and/or envelope (ENV) polypeptides of 
PoEV. Thus, the invention further provides a recombinant PoEV 
virion core, polymerase and/or envelope polypeptide. 

"Polynucleotide fragment" as used herein refers to a chain 
of nucleotides such as deoxyribose nucleic acid (DNA) and 



BNSOOCID: <WO_9740167A1J«> 



wo 97/40167 PCT/GB97/01087 

3 

transcription products thereof, such as RNA. Naturally, the 
skilled addressee will appreciate the whole naturally occurring 
PoEV genome is not included in the definition of polynucleotide 
fragment. 

The polynucleotide fragment can be isolated in the sense 
that it is substantially free of biological material with which 
the whole genome is normally associated in vivo. The isolated 
polynucleotide fragment may be cloned to provide a recombinant 
molecule comprising the polynucleotide fragment • Thus, 
•'polynucleotide fragment" includes double and single stranded 
DNA, RNA and polynucleotide sequences derived therefrom, for 
example, subsequences of said fragment and which are of any 
desirable length. Where a nucleic acid is single stranded then 
both a given strand and a sequence complementary thereto is 
within the scope of the present invention. 

In general, the term "expression product" refers to both 
transcription and translation products of said polynucleotide 
fragments. When the expression product is a "polypeptide" (i.e. 
a chain or sequence of amino acids displaying a biological and/or 
immunological activity substantially similar to the biological 
and/or immunological activity of PoEV virion core, polymerase 
and/or envelope protein) , it does not refer to a specific length 
of the product as such. Thus, the skilled addressee will 
appreciate that "polypeptide" encompasses inter alia peptides, 
polypeptides and proteins of PoEV. The polypeptide if required, 
can be modified in vivo and in vitro, for example by 
glycosylation, amidation, carboxylation, phosphorylation and/ or 
post-translational cleavage . 
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Polynucleotide fragments comprising portions encompassing 
the PoEV genome, and derived from retrovirus particles released 
from a reverse transcriptase-positive porcine kidney cell line 
PK-15, have been molecular ly cloned into a plasmid vector. This 
was achieved by synthesising cDNAs of PoEV RNA genomes which were 
recovered from porcine kidney cells expressing the endogenous 
virus. The cDNA was cloned into a plasmid vector and the 
isolated PoEV DNA fragment determined (see Figures 1,2 and 3). 
The sequence of the sequence identified in Figure 1 was the 
earliest determined sequence, followed by the sequence in Figure 
2 and lastly by the most recently revised sequence shown in 
Figure 3 . An additional study has been carried out to determine 
whether or not the human cell line "Raji" was susceptible to 
infection with the PoEV present in porcine kidney cells (PK15) . 
A raji clone has now been obtained and the DNA sequence of its 
env gene region has been determined (see Figure 4) . 

The DNA fragment of Figure 3 was shown to encode three open 
reading frames (ORFs) of 524, 1194 and 656 amino acids 
respect i ve ly . 

A comparison of the amino acid sequence against previously 
sequenced retroviruses from other species indicated that novel 
retrovirus cDNA had been cloned. Sequence analysis using the 
Lasergene software from DNASTAR Inc. showed that homologies were 
observed between the cloned PoEV DNA and the majority of 
retroviruses and that the closest homologies were to gibbon 
leukaemia virus (GaLV) in the polymerase ipol) and (env) regions 
of the pro-virus. 

The first open reading frame ORF of Figure 3 (nucleotides 588- 
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2162) is predicted to encode the PoEV virion core polypeptide 
(gag gene) . The second ORF (nucleotides 2163-5747) is predicted 
to encode the PoEV polymerase polypeptide (pol gene) . The third 
ORF (nucleotides 5620-7590) is predicted to encode the PoEV 
envelope polypeptide (env gene) . The skilled addressee will 
appreciate that it is possible to genetically manipulate the 
polynucleotide fragment or derivatives thereof, for example to 
clone the gene by recombinant DNA techniques generally known in 
the art and to express the polypeptides encoded thereby in vitro 
and/or in vivo , DNA fragments having the polynucleotide sequence 
depicted in Figures 1,2,3 and/or 4 or DNA/RNA derivatives 
thereof, may be used as a diagnostic tool or as a reagent for 
detecting PoEV nucleic acid in nucleic acids from donor animals 
or as a vaccine* 

Preferred fragments of this aspect of the invention are 
polynucleotide fragments encoding: (a) at least one of the one 
to three polypeptides having an amino acid sequence which is 
shown in Figures 1,2,3 and/or 4 (b) encoding a polypeptide which 
is a physiologically active and/or immunogenic derivative of at 
least one of the polypeptides defined in (a) ; or (c) which is 
complementary to a polynucleotide sequence as defined above; or 
polynucleotide fragments: (a) comprising at least one of the ORFs 
shown in Figures 1,2,3 and/or 4 or comprising a corresponding RNA 
sequence; (b) comprising a sequence having substantial nucleotide 
sequence identity with a sequence as described in (a) above; or 
(c) comprising a sequence which is complementary to a sequence 
as described in (a) or (b) above. It is to be understood that 
the term "substantial sequence identity" is taken to mean at 
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least 50% (preferably at least 75%, at least 90%, or at least 
95%) sequence identity. 

The polynucleotide fragment of the present invention may be 
used to examine the expression and/or presence of the PoEV virus 
in donor animals and cells, tissues or organs derived from the 
donor animals to see if they are suitable for xenotransplantation 
(i.e. PoEV free). In addition, the recipients of pig cells, 
tissues or organs can be examined for the presence and/ or 
expression of PoEV virus directly or by co-culture or infection 
of susceptible detector cells. 

A polynucleotide fragment of the present invention may be 
used to identify polynucleotide sequences within the PoEV genome 
which are PoEV specific (i,e. it is not necessary for the 
complete PoEV genome to be identified) . Such PoEV specific 
polynucleotide sequences may be used to identify PoEV nucleic 
acid in samples, such as transplanted cells, tissues or organs 
and may be included in a definitive test for PoEV. 

Thus, the present invention further provides an isolated 
PoEV polynucleotide fragment capable of specifically hybridising 
to a PoEV polynucleotide sequence. In this manner, the present 
invention provides probes and/or primers for use in ex vivo 
and/or in situ PoEV virus detection and expression studies. 
Typical detection studies include polymerase chain reaction (PGR) 
studies, hybridisation studies, or sequencing studies. In 
principle any PoEV specific polynucleotide sequence from the 
above identified PoEV sequence may be used in detection and/or 
expression studies. 

"Capable of specifically hybridising" is taken to mean that 
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said polynucleotide fragment preferably hybridises to a PoEV 
polynucleotide sequence in preference to polynucleotide sequences 
of other virus, animal (especially porcine or human sequences) 
and/or other species. In a preferment the PoEV fragment 
specifically binds to a native PoEV polynucleotide sequence or 
a part thereof. 

The invention includes polynucleotide sequence (s) which are 
capable of specifically hybridising to a PoEV polynucleotide 
sequence or to a part thereof without necessarily being 
completely complementary to said PoEV polynucleotide sequence or 
fragment thereof. For example, there may be at least 50% 
preferably at least 75%, most preferably at least 90% or at least 
95% complementarity. Of course, in some cases the sequences may 
be exactly complementary (100% complementary) or nearly so (e.g. 
there may be less than 10, preferably less than 5 mismatches) . 
Thus, the present invention also provides anti-sense or 
complementary nucleotide sequence (s) which is/are capable of 
specifically hybridising to the disclosed DNA sequence. If a 
PoEV specific polynucleotide is to be used as a primer in PGR 
and/or sequencing studies, the polynucleotide must be capable of 
hybridising to PoEV nucleic acid and capable of initiating chain 
extension from 3' end of the polynucleotide, but not able to 
correctly initiate chain extension from non PoEV sequences 
(especially from human, or non-PoEV porcine sequences) . 

If a PoEV specific test polynucleotide sequence is to be 
used in hybridisation studies, to test for the presence of PoEV 
nucleic acid in a sample, the test polynucleotide should 
preferably remain hybridised to a sample polynucleotide under 
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Stringent conditions* If desired, either the test or sample 
polynucleotide may be immobilised. Generally the test 

polynucleotide sequence is at least 10 or at least 50 bases in 
length* It may be labelled by suitable techniques known in the 
art. Preferably the test polynucleotide sequence is at least 200 
bases in length and may even be several kilobases in length. 
Thus, either a denatured sample or test sequence can be first 
bound to a support. Hybridization can be effected at a 
temperature of between 50 and 70^0 in double strength SSC (2xNaCl 
17.5g/l and sodium citrate (SC) at 8.8g/l) buffered saline 
containing 0.1% sodium dodecyl sulphate (SDS) . This can be 
followed by rinsing of the support at the same temperature but 
with a buffer having a reduced SSC concentration. Depending upon 
the degree of stringency required, and thus the degree of 
similarity of the sequences, such reduced concentration buffers 
are typically single strength SSC containing 0,1%SDS, half 
strength SSC containing 0.1%SDS and one tenth strength SSC 
containing 0.1%SDS. Sequences having the greatest degree of 
similarity are those the hybridisation of which is least affected 
by washing in buffers of reduced concentration. It is most 
preferred that the sample and inventive sequences are so similar 
that the hybridisation between them is substantially unaffected 
by washing or incubation in one tenth strength sodium citrate 
buffer containing 0.1%SDS. 

PoEV specific oligonucleotides may be designed to 
specifically hybridise to PoEV nucleic acid. They may be 
synthesised, by known techniques and used as primers in PCR or 
sequencing reactions or as probes in hybridisations designed to 
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detect the presence of PoEV material in a sample. The 
oligonucleotides may be labelled by suitable labels known in the 
art, such as, radioactive labels, chemi luminescent labels or 
fluorescent labels and the like. Thus, the present invention 
also provides PoEV specific oligonucleotide probes and primers • 

The term "oligonucleotide" is not meant to indicate any 
particular length of sequence and encompasses nucleotides of 
preferably at least 10b (e.g. 10b to Ikb) in length, more 
preferably 12b-500b in length and most preferably 15b to 100b. 

The PoEV specific oligonucleotides may be determined from 
the PoEV sequences shown in Figure 1 and may be manufactured 
according to known techniques. They may have substantial 
sequence identity (e.g. at least 50%, at least 75%, at least 90% 
or at least 95% sequence identity) with one of the strands shown 
therein or an RNA equivalent, or with a part of such a strand. 
Preferably such a part is at least 10, at least 30, at least 50 
or at least 2 00 bases long. It may be an ORF or a part thereof. 

Oligonucleotides which are generally greater than 30 bases 
in length should preferably remain hybridised to a sample 
polynucleotide under one or more of the stringent conditions 
mentioned above. Oligonucleotides which are generally less than 
30 bases in length should also preferably remain hybridised to 
a sample polynucleotide but under different conditions of high 
stringency. Typically the melting temperature of an 

oligonucleotide less than 30 bases may be calculated according 
to the formula of; 2*^0 for every A or T, plus 4^C for every G or 
C, minus 5"C. Hybridisation may take place at or around the 
calculated melting temperature for any particular 
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Oligonucleotide, in 6 x SSC and 1% SDS. Non specifically 
hybridised oligonucleotides may then be removed by stringent 
washing, for example in 3 x SSC and 0.1% SDS at the same 
temperature. Only substantially similar matched sequences remain 
hybridised i.e. said oligonucleotide and corresponding PoEV 
nucleic acid. 

When oligonucleotides of generally less than 3 0 bases in 
length are used in sequencing and/ or PGR studies, the melting 
temperature may be calculated in the same manner as described 
above. The oligonucleotide may then be allowed to anneal or 
hybridise at a temperature around the oligonucleotides calculated 
melting temperature. In the case of PGR studies the annealing 
temperature should be around the lower of the calculated melting 
temperatures for the two priming oligonucleotides. It is to be 
appreciated that the conditions and melting temperature 
calculations are provided by way of example only and are not 
intended to be limiting- It is possible through the experience 
of the experimenter to vary the conditions of hybridisation and 
thus anneal/hybridise oligonucleotides at temperatures above 
their calculated melting temperature. Indeed this can be 
desirable in preventing so-called non-specific hybridisation from 
occurring. 

It is possible when conducting PGR studies to predict an 
expected size or sizes of PGR product (s) obtainable using an 
appropriate combination of two or more PoEV oligonucleotides, 
based on where they would hybridise to the sequence in Figure 1. 
If, on conducting such a PGR on a sample of PoEV DNA, a fragment 
of the predicted size is obtained, then this is predictive that 
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the DNA is PoEV. 

The present invention also encompasses PoEV detection kits 
including at least one oligonucleotide which is PoEV specific, 
as well as any necessary reaction reagents, washing reagents, 
detection reagents, signal producing agents and the like for use 
in the test formats outlined above. 

In a further aspect there is also provided use of a PoEV 
specific polynucleotide in the detection of PoEV in a sample - 

In a yet further aspect there is provided use of a PoEV 
specific polynucleotide in a PCR for the detection of PoEV in a 
sample. 

The skilled addressee will appreciate how polynucleotide 
fragments may be designed and used as primers/probes in 
polymerase chain reaction (PCR) experiments or Southern analysis 
(i.e. hybridisation studies) for detecting the presence or 
otherwise of PoEV polynucleotide in the nucleic acid of pigs or 
in cell, tissue or organ samples taken from pigs (e.g. from 
potential transplant organs such as liver, kidney and heart) . 
Such cells, tissues or organs can be derived from transgenic 
animals produced as described in EP-A-0493852 , or by other means 
known in the art. Thus the cells, tissues or organs of 
transgenic pigs can be associated with one or more homologous 
complement restriction factors active in humans to prevent/reduce 
activation of complement. 

Furthermore the polynucleotide fragments of the present 
invention can be used to analyze the genetic organisation of 
endogenous PoEV located in the animal cell genome in pigs thus 
permitting the screening of herds of pigs for altered provirus 
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and genomic loci (e.g. non-expressed provirus loci) . Such a 
screening method would facilitate, for example, screening in a 
population of animals which are bred to lack expressed provirus 
and genomic loci and/or loci that do not encode infectious virus 
particles. 

Reagents may also be developed from said polynucleotide 
fragments as aids to develop pigs that do not express an 
infectious, PoEV capable of infecting humans. Such pigs could 
still contain partial defective genomes that could result in the 
expression of non-infectious particles, viral proteins or viral 
mRNA. Alternatively, it may be possible to use constructs 
derived from the PoEV polynucleotide sequence to act as 
insertional mutagens to knockout the productive infectious PoEV 
in embryos, embryonic stem cells, or cells containing 
totipotential nuclei capable of forming a viable embryo. Thus 
gag, pol and/or env gene "knockouts" may be constructed to allow 
development of breeding programmes in pigs whereby endogenous 
PoEV is substantially prevented or reduced. For example the 
nucleotide sequence of PoEV can be manipulated e.g. by deletion 
of a coding sequence in vitro and the resulting construct used 
to replace the natural PoEV sequence by recombination. Thus, the 
proviral genome can be rendered inactive in the porcine cells. 
The knockouts can be manipulated into embryos and/or stem cells 
and if required manipulated nuclei can be transferred from target 
cells to germ cells using micromanipulation techniques well known 
in the art. The invention also extends to animals derived from 
such germ cells. 

Thus, transgenic pigs may be produced containing anti-sense 
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constructs and/or ribozyme constructs capable of downregulating 
the expression of viral proteins, or transgenic pigs expressing 
a single chain immunoglobulin molecule with specificity for PoEV 
proteins or other protein that might interfere with protein 
synthesis or viral assembly may also be produced. Similar 
transgenes encoding trans-dominant negative regulators of PoEV 
expression or transgenes encoding competative defective "genomic 
RNAs" may be used to reduce or eliminate the production of 
infectious virions. The generation of reagents to suppress the 
expression of native PoEV loci in pigs, such as, by generation 
of antisense nucleic acids (e.g. antisense mRNAs) , ribozymes or 
other antiviral reagents may also be developed. 

The polynucleotide fragment can be molecularly cloned into 
a prokaryotic or eukaryotic expression vector using standard 
techniques and administered to a host. The expression vector is 
taken up by cells and the polynucleotide fragment of interest 
expressed, producing protein. Presentation of the protein on 
cell surface stimulates the host immune system to produce 
antibodies immunoreactive with said protein as part of a defence 
mechanism. Thus, expressed protein may be used as a vaccine. 

Inactivated vaccines can be produced from PoEV's or cells 
releasing PoEV. Such infected cells may be generated by natural 
infection or by transfection of a proviral clone of PoEV. it 
will be understood that a proviral clone is a molecular clone 
encoding on at least one antigenic polypeptide of PoEV. After 
harvesting the virus and/or the infected cells, viruses or 
infected cells present can be inactivated for example, with 
formaldehyde, gluteraldehyde, acetylethylenimine or other 
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suitable agent: or process to generate an inactivated vaccine 
using methods commonly employed in the art, (CVMP Working Party 
on Immunological Veterinary Medicinal Products (1993). General 
requirements for the production and control of inactivated 
mammalian bacterial and viral vaccines for veterinary use) • Sub 
unit vaccines may be prepared from the individual proteins 
encoded by the gag, pol and env genes. Typically a vaccine would 
contain env gene products either alone or in combination with gag 
genes produced by expression in bacteria, yeast or mammlian cell 
systems . 

Proviral clones of PoEV can be engineered to develop single 
cycle or replication defective viral vectors suitable for 
vaccination using techniques- Such viral vectors known in the 
art (e.g. MuLV Murine Leukaemia Retrovirus, Adenovirus and 
Herpesviruses (Anderson WF. (1992) . Human Gene Therapy. Science 
256, 808-813) may have one or more genes essential for 
replication deleted, with the missing gene function expressed 
constitutively or conditionally from a further, different 
construct which is integrated into the chromosomal DNA of a 
complementing cell line to the proviral PoEV clone. PoEV virions 
released from the cell line may infect secondary target cells in 
the vaccinee but not produce further infectious virus particles. 
For instance, the polynucleotide sequence encoding the reverse 
transcriptase domain of pol can be deleted from the proviral PoEV 
clone and the reverse transcriptase domain of poi integrated into 
the complementing cell line. 

It will be understood that the polynucleotides; 
polypeptides; PoEV free cells, tissues and/ or organs encompassed 
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by the present invention could be used in therapy, diagnosis, 
and/or methods of treatment. The polynucleotides; polypeptides; 
PoEV free cells, tissues and/or organs encompassed by the present 
invention can also be used in the preparation of medicaments for 
use in therapy or diagnosis. 

The cloning and expression of a recombinant PoEV 
polynucleotide fragment also facilitates in producing anti-PoEV 
antibodies and fragments thereof (particularly monoclonal 
antibodies) and evaluation of in vitro and in vivo biological 
activity of recombinant PoEV polymerase and/or envelope 
polypeptides. The antibodies may be employed in diagnostic tests 
for native PoEV virus. 

It will be understood that for the particular PoEV 
polypeptides embraced herein, natural variations can exist 
between individuals or between members of the family Suidae (i.e. 
the pig family) . These variations may be demonstrated by (an) 
amino acid dif f erence(s) in the overall sequence or by deletions, 
substitutions, insertions, inversions or additions of (an) amino 
acid{s) in said sequence. All such derivatives showing active 
polymerase and/or envelope polypeptide physiological and/or 
immunological activity are included within the scope of the 
invention. For example, for the purpose of the present invention 
conservative replacements may be made between amino acids within 
the following groups: 

(I) Alanine, serine, threonine; 

(II) Glutamic acid and aspartic acid; 

(III) Arginine and leucine; 

(IV) Asparagine and glutamine; 
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(V) 



Isoleuclne, leucine and valine; 



(VI) Phenylalanine, tyrosine and tryptophan 



Moreover, recombinant DNA technology may be used to prepare 



nucleic acid sequences encoding the various derivatives outlined 
above • 

As is well known in the art, the degeneracy of the genetic 
code permits substitution of bases in a codon resulting in a 
different codon which is still capable of coding for the same 
amino acid, e.g. the codon for amino acid glutamic acid is both 
GAT and GAA. Consequently, it is clear that for the expression 
of polypeptides with the amino acid sequences shown in Figure 1 
or fragments thereof, use can be made of a derivative nucleic 
acid sequence with such an alternative codon composition 
different from the nucleic acid sequence shown in said Figure 1. 

Furthermore, fragments derived from the PoEV core, 
polymerase and/or envelope polypeptides as depicted in Figure 3, 
which still display PoEV virus core polypeptide, polymerase 
and/or envelope polypeptide properties, or fragments derived from 
the nucleic acid sequence encoding the virus core polypeptides, 
polymerase and/or envelope polypeptides or derived from the 
nucleotide sequence depicted in Figures 1,2,3 and/or 4encoding 
fragments of said virus core polypeptide, polymerase and/or 
envelope polypeptides are also included of the present invention. 
Naturally, the skilled addressee will appreciate within the ambit 
that the said fragments should substantially retain the 
physiological and/or immunological properties of the GAG, POL 
and/or ENV polypeptides. 

The PoEV polynucleotide fragment of the present invention 
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is preferably linked to regulatory control sequences. Such 
control sequences may comprise promoters, operators, inducers, 
enhancers, ribosome binding sites, terminators etc. Suitable 
control sequences for a given host may be selected by those of 
ordinary skill in the art. 

A polynucleotide fragment according to the present invention 
can be ligated to various expression controlling sequences, 
resulting in a so called recombinant nucleic acid molecule. 
Thus, the present invention also includes an expression vector 
containing an expressible PoEV nucleic acid molecule. The 
recombinant PoEV nucleic acid molecule can then be used for the 
transformation of a suitable host. Such hybrid molecules are 
preferably derived from, for example, plasmids or from nucleic 
acid sequences present in bacteriophages or viruses and are 
termed vector molecules. 

Specific vectors which can be used to clone nucleic acid 
sequences according to the invention are known in the art (e.g. 
Rodriguez, R.L. and Denhadt, D.T. , Edit., Vectors: a survey of 
molecular cloning vectors and their uses, Butterworths, 1988) . 

The methods to be used for the construction of a recombinant 
nucleic acid molecule according to the invention are known to 
those of ordinary skill in the art and are inter alia set forth 
in Sambrook, et al . (Molecular Cloning: a laboratory manual Cold 
Spring Harbour Laboratory, 1989) . 

The present invention also relates to a transformed cell 
containing the PoEV polynucleotide fragment in an expressible 
form. "Transformation", as used herein, refers to the 
introduction of a heterologous polynucleotide fragment into a 
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host cell. The method used may be any known in the art, for 
example, direct uptake, transfection transduction or electro 
poration (Current Protocols in Molecular Biology, 1995. John 
Wiley and Sons Inc) . The heterologous polynucleotide fragment 
may be maintained through autonomous replication or 
alternatively, may be integrated into the host genome. The 
recombinant nucleic acid molecules preferably are provided with 
appropriate control sequences compatible with the designated host 
which can regulate the expression of the inserted polynucleotide 
fragment, e.g. tetracycline responsive promoter, thymidine kinase 
promoter, SV-4 0 promoter and the like. 

Suitable hosts for the expression of recombinant nucleic 
acid molecules may be prokaryotic or eukaryotic in origin. Hosts 
suitable for the expression of recombinant nucleic acid molecules 
may be selected from bacteria, yeast, insect cells and mammalian 
cells. 

Since the biological half life and the degree of 
glycosylation of recombinant PoEV virus core polypeptide, 
polymerase and/ or envelope polypeptides may be important for use 
in vivo, yeast and baculovirus systems, in which a greater degree 
of processing and glycosylation occur, are preferred. The yeast 
strain Pichia Pastoris exhibits potential for high level 
expression of recombinant proteins (Clare et al., 1991). The 
baculovirus system has been used successfully in the production 
of type 1 interferons (Smith et al., 1983). 

Embodiments of aspects of the present invention will now be 
described by way of example only which are not intended to be 
limiting thereof. 
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Examples Section 

Example 1 

Preparation of viral RNA 

SOOml of supernatant derived from exponentially growing porcine 
kidney cells {PK-15, American Type Culture Collection CCL 33) was 
clarified by centrif ugation of approximately ll,000xg for 10 
minutes* Virus was pelleted from the clarified supernatant by 
centrif ugation at approximately lOO^OOOxg for 60 minutes. The 
supernatant was discarded and the viral pellet retained for the 
preparation of viral RNA genomes. RNA was prepared from the 
virus pellet using a Dynabeads (registered trade mark) mRNA 
Direct kit according to the manufacturer's protocols; A PoEV 
virus pellet was resuspended in 500^1 of TNE (lOmM Tris HCl 
pHd.O, O.IM NaCl^lmM EDTA) and the virions disrupted by the 
addition of 2ml of lysis/binding buffer. Dynabeads 01igo(dT)25 
were conditioned according to the manufacturer's instructions and 
added to the virus disrupted solution. Viral RNA was allowed to 
bind to the Dynabead for 10 minutes before the supernatant was 
removed and the bound RNA was washed three times with washing 
buffer with LiDS (0.5ml) and twice with washing buffer alone. 
The RNA was finally resuspended in 25 /xl of elution solution. 
All procedures were performed at ambient temperature. RNase 
contamination was avoided by the wearing of gloves, observation 
of sterile technique and treatment of solutions and non- 
disposable glass and plasticware with diethyl pyrocarbonate 
(DEPC) . The RNA was resuspended in DEPC- treated sterile water. 
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Example 2 
Synthesis of cDNA 

cDNA was synthesised from the purified genomic RNA using Great 
Lengths ™ cDNA amplification reverse transcriptase reagents 
(Clontech Laboratories Inc.) following the manufacturer's 
instructions. The RNA was primed with both oligo(dT) and random 
hexamers to maximise synthesis* 

The Great Lengths cDNA synthesis protocol is based on a modified 
Gubler and Hoffman (1983) protocol for generating complementary 
DNA libraries and essentially consists of first-strand synthesis, 
second strand synthesis, adaptor ligation, and size 
f ractionact ion . 

First strand synthesis: lock-docking primers anneal to the 
beginning of the poly-A tail of the RNA due to the presence of 
A, C or a residue at the 3*-end of the primer. This increases 
the efficiency of cDNA synthesis of eliminating unnecessary 
reverse transcription of long stretches of poly-A. In addition, 
the reverse transcriptase used is MMLV (RNase H) which gives 
consistently better yields than do wild-type MMLV or AMV reverse 
transcriptase. 

Second strand synthesis: the ratio of DNA polymerase I for 
RNase H has been optimised to increase the efficiency of the 
second strand synthesis and to minimize priming by hair pin loop 
formation. Following secoond-strand synthesis, the ds cDNA is 
treated with T4 DNA polymerase to create blunt ends. 
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Adaptor ligation: the cDNA is ligated to a specially 
designed adaptor that has a pre-existing EcoRI "sticky end". The 
use of this adaptor, instead of a linker, eliminates the need to 
methylate and the EcoRI - digest the cDNA, and thus leaves 
internal EcoRI, sites intact. The adaptor is 5'-phosphorylated 
at the blunt end to allow efficient ligation to the blunt-ended 
cDNA. 

Size fractionation: the ds cDNA is phosphorylated at the 
EcoRI sites and size-fractionated to remove unligated adaptors 
and unincorporated nucleotides. The resulting cDNA is ready for 
cloning into a suitable EcoRI-digested vector. 

Example 3 

Molecular cloning of cDNA 

The size fractionated fragment was ligated with EcoR I- digested 
pZErp™ -1 plasmid vector DNA (Invitrogen Corporation, San 
Diego, U.S.). The ligation mix was used to transform competent 
TOPlOF'cells and these were plated onto L-Agar containing zeocin 
following the manufacturer's instructions (Zero Background''''^' 
cloning kit - Invitrogen) . Several of the resulting zeocin 
resistant colonies were amplified in L-Broth containing zeocin 
and the plasmid DNA was purified by alkaline lysis (Maniatis et 
al., 1982). 

The plasmid DNA was digested to completion with the 
endonuclease EcoR I and the resulting DNA fragments were 
separated by electrophoresis through an 1.0% agarose gel 
(Maniatis et al . , 1982) , in order to check that a fragment in the 
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predicted size fractionated size range had been cloned. A clone 
identified as pPoEV was used in further experimentation. 
Example 4 

DHA sequence sn& lysis. 

pPoEV plasmid DNA was purified according to common techniques 
(Sambrook et al, 1989) and sequenced using an ABI automated 
sequencer. Overlapping sequencing primers from both strands of 
the molecular clone were used to determine the nucleotide 
sequence . 

The first sequence obtained is shown in Figure 1. This 
sequence was identified as encoding two ORFs of 924 (nucleotides 
23-2793) and 218 (nucleotides 2642-3297) amino acids, relating 
to the pol and env genes respectively. This sequence was revised 
and updated to the second sequence as shown in Figure 2. This 
second sequence was identified as encoding three ORFs of 516 
(nucleotides 576-2126), 1186 (nucleotides 2143-5733) and 656 
(nucleotides 5606-7576) amino acids, encoding the PoEV gag, pol 
and env genes respectively. This second sequence has since been 
revised and updated to the sequnce shown in Figure 3 . This third 
sequence was identified as encoding three ORFs of 524 
(nucleotides 588-2162), 1194 (nucleotides 2163-5747) and 656 
(nucleotides 5620-7590) amino acids, encoding the PoEV gag, pol 
and env genes respectively. 

The differences in the disclosed seqeunces is reflected by 
improvements in carrying out and analysing the sequence obtained. 
However, there is 100% identity at the nucleic acid level, 
between positions 21-2681 of the first sequence and positions 
2972-5653 of the third sequence. Overall there is a 70.5% 
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identity in the entire 3310 bp of the first sequence with a 
corresponding portion of the third sequence. 

There are only 3 base changes between the second sequence 
and the third sequence. These are as follows: 
base no. ffrom Figure 2) change 
2121 insertion of a "G" 

2157 insertion of a "G" 

5902 "R" to an "A" 

7700 "M" to an "A" 

The changes at base nos. 5902 and 7700 do not effect the 
corresponding amino acid sequence. However, the changes at 
positions 2121 and 2157 alter the amino acid sequence at the end 
of GAG and the begining of POL. For GAG the final amino acid "S" 
have now been replaced by "VLALEEDKD". The total product size 
is now 524 amino acids. For POL, the first five amino acids 
••RLGET" have been deleted and replaced by "GRR" . The total 
product size is now 1194 amino acids. 

Similarities were observed between pPoEV and the majority 
of retroviruses determined by using alogrithims from DNASTAR Inc. 
Lasergene software (DNASTAR) . The similarities were closest with 
gibbon ape leukaemia virus (GaLV) in the polymerase (pol) 
regions of the pro-virus at 68.5%, in the virus core (gag) 
region, 59.2% and in the envelope (^nv) region, 39.3% The 
nucleotide sequence and major ORFs of the pPoEV insert are shown 
in Figure 3. The largest ORF (nucleotides 2163-5747) encodes the 
polymerase polypeptide and the smaller ORFs (nucleotides 588-2162 
and 5620-7590) encode the core and envelope polypeptides 
respectively . 
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Example 5 

Purification of cellular DNA from cultur d c lis, tissues and 
blood. 

Cultured cells 

Cells were maintained in culture and approximately 5 x lO'^ cells 
were harvested for DNA preparation- The cells were pelleted by 
centrifugation resuspended in phosphate-buffered saline, 
re-centrif uged at lOOOg for 2 minutes and the supernatant was 
discarded. 

Porcine tissues 

Porcine tissue samples were frozen in liquid nitrogen and 
powdered by grinding in a mortar or between metal foil. The 
samples were resuspended in 5ml of extraction buffer consisting 
of 0.025M EDTA (pH 8.0), O.OlMTris.Cl pH 8.0, 0.5% SDS 20/ig/ml 
RNAse and lOO/xg/ml proteinase K (Maniatis et al . , 1982). 

Porcine blood 

A buffy coat was prepared from the blood samples. 20ml samples 
were centrifuged at lOOOg for 15 minutes. The buffy coat was 
resuspended in buffer and the samples centrifuged at lOOOg for 
15 minutes. The process was repeated one further time. The 
sample was mixed with 5ml (3x volume) of extraction buffer 
(Maniatis et ai . , 1982). 

Purification 

The samples (i.e. cultured cells, porcine tissue or porcine blood 
cells) in proteinase K-extraction buffer containing 20fig/ml RNAse 
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and lOOftg/ml proteinase K were digested for approximately 24 
hours at 37*'C. The deproteinised DNA was extracted twice with 
phenol and twice with phenol chloroform and finally precipitated 
by ethanol in the presence of ammonium acetate. The DNA was 
recovered by centrif ugation at 3 000g for 3 0 minutes and the 
supernatant discarded (Maniatis et al . , 1982). The pellet was 
washed in 70% ethanol and allowed to air dry for approximately 
1 hour. The DNA was allowed to re-dissolve in Tris EDTA (TE) 
buffer and the purity and concentration of the DNA was assessed 
by spectrophotometry (Maniatis et al * , 1982). 

Example 6 

Southern blot analysis of porcine tissue and cells 

In order to demonstrate that the molecularly cloned DNA 
comprising the insert from PoEV was derived from the PK-15 cell 
line (American Type Culture Collection CCL33) , the DNA was 
hybridised against cellular DMAs and its ability to detect 
proviral DNA was examined. 

DNA purified from pPoEV was radioactively labelled and used to 
probe a Southern blot of endonuclease digested DNAs derived from 
PK-15 cells . 

The DNAs probed were as follows : 

a) Copy number controls of pPoEV DNA linearized by digestion 
with EcoRI. One copy per haploid cell genome was estimated 
to be 6.84pg. The control was present at an estimated copy 
number of 1, 5 and 10 copies. 

b) PK-15 DNA. 

c) Negative control HeLa (American Type Culture Collection 
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CCL2) DNA derived from a human adenocarcinoma cell line 
harbouring human papillomavirus type 18 DNA. 
d) Negative control SP2 0 ( European Collection of Animal Cell 
Cultures 850724 01} DNA derived from a murine myeloma cell 
line harbouring a xenotropic MuLV retrovirus. 

A hybridisation signal was observed in only the PK-15 porcine 
DNA. No signal was detected in either the negative human or 
murine DNAs. The PK-15 DNA contained more than 10 copies per 
cell with an estimated copy number of 20. The sizes of the 
three major EcoRI- endonuclease digested DNA fragments were 
approximately 3.8kb, l.Skb and 0.6kb. The sizes of relevant 
fragments detected in the recombinant pPoEV were comparable. 



There are, as expected, a number of fragments common to the 
genomic DNA of PK-15 and pPoEV DNA and there is agreement 
between the patterns observed in both DNAs digested with Xhol, 
BamHI and Hindlll. However, there are additional fragments 
obtained on digestion of pPoEV DNA by a number of endonucleases . 

pPoEV sequences were also detected in swine testes (American Type 
Culture Collection CRL 1746) and primary porcine kidney cells 
(Central Veterinary Laboratory batch C04495) but not in hamster 
CHOKl (American Type Culture Collection CCL61) or murine NSO 
myeloma cells (European Collection of Animal Cell Cultures 
85110503) . 
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In order to demonstrate that the molecularly cloned DNA 
comprising the insert from pPoEV could detect sequences in 
porcine cells and tissues in addition to PK-15 the pPoEV DNA was 
hybridised against cellular DNA from tissues derived from pigs 
and its ability to detect proviral DNA was examined (Maniatis et 
al., 1982)- 

The DNA purified from pPoEV was radioactively labelled and used 
to probe a Southern blot of endonuclease digested DNAs derived 
from pig organs including liver, kidney, heart and blood. 

The DNAs probed were as follows : 

a) Copy number controls of pPoEV DNA linearized by digestion 
with EcoRI . One copy per haploid cell genome was estimated 
to be 6.84pg. The control was present at an estimated copy 
number of 5,10, 2 0 and 50 copies, 

b) DNA purified from the porcine tissues digested with 
EcoRI . 

A hybridisation signal was observed in all the porcine DNAs. 

The DNAs contained less than 5 copies per cell. There were 
approximately eight distinct bands in each DNA. The sizes of 
the three major endonuclease digested DNA fragments were 
approximately 3.8kb, 1.8kb and 0.6kb. 
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Example 7 

PolYBKsrsise C^siis^ Reaction (PGR) Amplif icatioas 
Oligonucleotides were selected from the PoEV genome. 

The upstream primer was 5'-GGA AGT GGA CTT CAC TGA G-3'. 

The downstream primer was 5' -CTT TCC ACC CCG AAT CGG -3'. 

The PGR was performed as described by Saiki et al (1987) . One 
1/il of lOOngZ/xl template DNA was added to a ASfxl reaction mixture 
containing 200/xM of dATP, dCTP, dGTP, dTTP, 30pmol of both 
primers from the pair described above, lunit of DNA polymerase 
and 5^1 of reaction buffer. The reaction buffer contained 200mM 
Tris-HCl pH 8.4, 500mM potassium chloride and 15mM magnesium 
chloride, ultrapure water. The solution was overlaid with two 
drops of mineral oil to prevent evaporation. Thirty five cycles 
of amplification were performed using a Perkin Elmer Cetus 
thermal cycler. Each cycle consisted of 1 minute, at 95^0 to 
denature the DNA, 1 minute • at Sl^'C to anneal the primers to the 
template and 1 minute, at 72**C for primer extension. After the 
last cycle- a further incubation for 10 minutes, at 72**C was 
performed to allow extension of any partially completed product. 
On completion of the amplification, 10/xl of the reaction mixture 
was electrophoresed through a 5 per cent acrylamide gel. The DNA 
was visualised by staining with ethidium bromide and exposure to 
ultraviolet light (320nm) . 
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The PGR reaction amplified a sequence of approximately 787bp 
from pPoEV and from porcine cells as expected indicating that the 
assay detected the PoEV proviral DNA. There was no specific 
amplification of the expected sequence in cells of non-porcine 
origin and therefore, the PGR reaction and recombinant clone can 
be used as a specific and sensitive diagnostic tool for detection 
of PoEV. 

Two further digonucleotides were designed against the 3 'end 
of the pol gene and s' end of the gag gene respectively. 

The 3' pol oligionucleotide was 5' -GAT GGC TCT OCT GCC CTT TG-3' 

The 5' gag oligionucleotide was 5'-CGA TGG AGG CGA AGC TTA AGG-3' 

The above oligionucleotide were also used in in PGR reactions 
according to the conditions described above, with the exceptions 
that the annealing temperature was 58° and 30 cycles of 
replication were carried out. The PGR reaction amplified a 
sequence of approximately 468bp from pPoEV and from porine cells. 

Example 8 

Production of PoEV polypeptide in Escherichia coll. 
The open reading frame (ORF) encoding the pol peptide was 
isolated from the pPoEV clone and molecularly cloned into the 
plasmid pGEX-4T-l (Pharmacia Ltd.) for expression. 

Two ml cultures of E. coli transformed with various expression 
constructs were grown with shaking at 37*'C to late log phase 
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(O.D-<ioonni of 0-6) and induced by the addition of IPTG to 0,1 mM. 
Induced cultures were then incubated for a further 2 hours after 
which the bacteria were collected by centrif ugation. The 
bacterial pellet was lysed by boiling in SDS-PAGE sample buffer 
and the protein profile of the induced bacteria was analysed on 
a 12% acrylamide gel (Laemmli, 1970) followed by staining with 
coomassie brilliant blue dye. 

Example 9 

Isolation and partial sequencing of Raii clone 

The aim of the study was to determine whether the human cell line 
"Raji" was susceptible to infection with the PoEV present in 
porcine kidney cells (PK15) • In order to test the capacity of the 
virus for xenotropism, PK15 cells were co-cultured with the B 
lymphoblastoid (Raji) cell line over 20 passages. 

The culture system utilised direct culture and transwells, which 
separated the human and porcine cells, but permitted viruses to 
pass through the separating membrane. After every fifth passage, 
supernatants from the human cell lines are tested for the 
presence of retrovirus by reverse transcriptase assay. 

Cell cultures 

Porcine kidney (PK15) cells (ATCC CCL 33) were used as the source 
of PoEV. The human cells used for co-cultivation with PK15 cells 
were the lymphoblast-like Burkitts lymphoma Raji (ATCC CCL 86) 
cell line. This cell line does not harbour endogenous 
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retroviruses and lacks reverse transcriptase activity when tested 
by the present inventors. 

Co-cultivation 



Raji cells were co-cultivated directly with PK15 cells in 
duplicate SOcm^ flasks and exposed to the PK15 cells throughout 
the 20 passage culture period. The cells were passaged twice per 
week and PK15 cells added as necessary from a stock culture. At 
every fifth passage a sample of Raji cells was removed from the 
co-culture, washed and cultured for 3-4 days. Supernatant was 
then harvested and tested for presence of retrovirus by reverse 
transcriptase (RT) assay. 

RESULTS 

The presence of reverse transcriptase activity with a preference 
for the Mn-* cation in the supernatant from detector cell 
cultures is suggestive of infection by porcine retrovirus. 
Reverse transcriptase activity with preference for the Mn^"*" 
template was not detected in the duplicate co-cultivated test 
cultures at passage 5 but was detected at passages 10, 15 and 20. 
No significant RT activity was detected in the negative control 
cultures. RT activity with preference for the Mn^"^ template was 
detected in positive control cultures at passage 5 and 20. 
An infected raji culture was diluted to single cells, and then 
a selection of cells cultured separately such that each culture 
originated from one cell. Each culture was tested by reverse- 
transcriptase assay . 
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Genomic DNA was made from an RT-positive clone as described in 
example 5 -purification. The PoEV ENV region was amplified by PGR 
as described below and the product molecularly cloned into pMOS 
blue T-vector (Amersham) • This molecular clone was then sequenced 
(Fig- 4) . 

PGR 

Oligonucleotides were selected from the PoEV genome. 

The upstream primer was 5' -GAT GGC TCT CCT GCC CTT TG -3' 
5' base position: 5240 

The downstream primer was 5' -CCA CAG TCG TAG ACC ACG -3' 
5' base position: 8144 

Expected product size: 2904bp 

Approx. 1 /Ltg of genomic raji clone DNA was added to a 50 m1 
reaction mixture containing 2 00 of dATP, dCTP, dGTP, dTTP, 

30pM each primer detailed above, lu Taq DNA polymerase and 5^1 
reaction buffer. The reaction buffer contained 200mM Tris.Cl pH 
8.4, SOOmM potassium chloride, 15mM magnesium chloride and 
ultrapure water- The solution was overlaid with two drops of 
mineral oil to prevent evaporation. Thirty cycles of 
amplification was performed followed by an elongated extension 
reaction of 60min. at 72''C. 
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The cycles consisted of: 
95^C 1 min- 
se^'C 1 min. 
72**C 2 min. 

The PGR product was visualised as described in example 7 
Product size: -'3Kb* 

CLONING 



The PGR product was molecularly cloned into pMOS-Blue T-vector 
as directed by the manufacturer (pMOS-Blue T-vector kit - 
Amersham) • 

20 transformed colonies (clones) were picked and added to 5mls 
L-broth containing 50 ^g/ml ampicillin. The cultures were grown 
shaking at 37**G overnight, Plasmid DNA was isolated from each 
clone using the perfect prep plasmid isolation kit as directed 
by the manufacturer (5 Prime - 3 Prime Inc. Boulder, CO, USA). 

Plasmid DNA was digested to completion with the endonucleases 
EcoRI and Hindlll and the products visualised on an ethidium 
bromide-stained 1% agarose gel. A clone (raji env clone) showing 
the same banding pattern as that predicted for 'PK15 cell line 
derived PoEV , was selected for sequencing. 

SEQUENCING 

Raji env clone plasmid DNA prepared above was sequenced using an 
ABI automated sequencer, and the commercially availableT7 
sequencing primer. 
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The entire eiiv gene region of the "Raji" was sequnced (see Figure 
4) and discovered to have substantial sequence identity at both 
the nucleic acid and amino acid levels (98.9% and 96.3% 
respectively) with the PoEV sequence from PK-15. 

Example 10 

Phyloaenetic analysis 

Phylogenetic analysis was performed using the PHYLIP package. 
Sequence distances were calculated using the PROTDIST program 
(Dayhoff matrix) and a neighbour- joining unrooted phylogenetic 
tree reconstructed using the NEIGHBOUR program. 

Bootstrapping was performed using 200 replicates of the pol 
alignment, created using the SEQBOOT program and a consensus tree 
was obtained using the CONSENSE program (see Figure J) . The 
bootstrap percentages are indicated at the branch fork, with 
missing values equal to 100%. The data indicate that PoEV is 
closely related to but distinct from the type-C oncovirus 
typified by gibbon, murine and feline leukaemia viruses. 
A phylogenetic tree was constructed from the pol alignment using 
the maximum likliehood algorithm (Dayhoff matrix) . This tree 
differed from the pol NJ tree only in the placement of the BaEV 
lineage in relation to other mammalian type C viruses and 
corresponded to a low bootstrap for the BaEV fork observed in the 
NJ tree. 
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gxample H 

Analysis of the LTR and adjacent region 

The long terminal repeat (LTR) is a reiterated sequence at each 
end of the provirus that contains the enhancer and promoter 
governing transcription of the provirus as well as sequences 
required for reverse transcription of the RNA genome and 
integration of the proviral DNA. Three recognised domains of the 
LTR are identifiable, U3 , R and U5 with the LTR being delineated 
by inverse repeats AATGAAAGG and CCTTTCATT at the 5 ' and 3 ' ends 
of U3 and US respectively. 



LTR Domain PoEV Genome Sequence Length bp 

in accordance with Figure 3 
U3 7638-8106 469 

R* 8107-8188,1-61 82 

US 62-143 82 



*The position of the R is defined here by similarity to the 3 'end 
of the MuLV LTR and is compatible with the observed location of 
a cap site approximatelty 24 bp downstream of the TATA box. 



The U3 region contans multiple potential transcription sites as 
shown in Figure 6. Most of the U3 region shows little or no 
homology to other mammalian type-C retroviruses which show 
conserved sites or repeat elements. However, there is homology 
to other mammaliann type-C viruses towards the 3 'end of the U3 
& region and into R and U5 . Amongst the potential transcription 
factor sites are those for the following: 
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LyF-1 is a transcriptional regulator that interacts with a novel 
class of promoters for lymphocyte-specific-genes (Lo et al 1991) • 



E47 is the prototype member of a new family of tissue specific 
enhancer proteins that have been shown to bind to the enhancer 
of murine leukaemia virus. 

ETS-1 is a transcription factor primarily expressed in the 
haematopoietic 1 ineage . 

The LTR contains direct repeats at 80006-8062 and 8045-8101 which 
together contain three potential CCAATT boxes. A potential TATA 
box is located at position 8129-8144. 

The R region contains a PADS (Poly A downstream element) and 
consensus polyadenylation signal (AATAAA) . 

The primer binding site (PBS) of PoEV is glycine (2) tRNA which 
has not reported for any exogenous retrovirus. 
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Claims 



1. An isolat.ed polynucleotide fragment: 

(a) encoding at least one porcine retrovirus (PoEV) 
expression product; 

(b) encoding a physiologically active and/or immunogenic 
derivative of said expression product; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 

2. An isolated polynucleotide fragment according to claim 1: 

(a) encoding the polymerase (POL) polypeptide; 

(b) encoding a physiologically active and/or immunogenic 
derivative of a polypeptide as described in (a) ; or 

(c) which is complementary to a polynucleotide sequence as 



3. An isolated polynucleotide fragment according to claim 1: 

(a) encoding the virion core polypeptide (GAG) and/or 
envelope polypeptide (ENV) ; 

(b) encoding a physiologically active and/or immunogenic 
derivative of a polypeptide as described in (a) ; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 



defined in (a) or (b) . 
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4. An isolated polynucleotide fragment according to claim i: 

(a) encoding the virion core polypeptide (GAG) , polymerase 
(POL) and envelope polypeptide (ENV) ; 

(b) encoding a physiologically active and/or immunogenic 
derivative of a polypeptide as described in (a) ; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 



An isolated polynucleotide fragment according to any one 
claims 1 to 4 wherein the polynucleotide fragments is 
deoxyribose nucleic acid (DNA) fragment. 

An isolated polynucleotide fragment according to a 
preceding claim encoding: 



(a) said at least one polypeptide having an amino acid 
sequence which is shown in Figures 3 or 4 ; 

(b) encoding a polypeptide which is a physiologically 
active and/or immunogenic derivative of at least one 
of the polypeptides defined in (a) ; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 



An isolated polynucleotide 
preceding claim; 
(a) comprising at least one 
1,2^3 or 4 or comprising 



fragment according to any 

of the ORFs shown in Figures 
a corresponding RNA sequence; 
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(b) comprising a sequence having substantial nucleotide 
sequence identity with a sequence as described in (a) 
above ; or 

(c) comprising a sequence which is complementary to a 
sequence as described in (a) or (b) above. 

8. A recombinant nucleic acid molecule comprising a 
polynucleotide fragment according to any one of claims 1 to 
7. 

9. A recombinant nucleic acid molecule according to claim 8 
wherein the recombinant nucleic acid molecule comprises 
regulatory control sequences operably linked to said 
polynucleotide fragment for controlling expression of said 
polynucleotide fragment. 

10. A vector comprising a recombinant nucleic acid molecule 
according to either of claims 8 or 9 . 

11- A vector according to claim 10 which is a virus or a 
plasmid. 

12. A prokaryotic or eukaryotic host cell transformed by a 
polynucleotide fragment, recombinant nucleic acid molecule, 
or vector according to any of claims 1 to 11. 
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13. A recombinant PoEV polypeptide or derivative thereof 

displaying POL PoEV physiological and/or iiamunogenic 
activity. 



14 . A recombinant PoEV polypeptide or derivative thereof 
displaying GAG and/or ENV PoEV physiological and/or 
immunogenic activity. 



15, A recombinant PoEV polypeptide or derivative thereof 
displaying GAG, POL and ENV PoEV physiological and/or 
immunogenic activity . 



16. A recombinant PoEV polypeptide according to any one of 
claims 13 to 15 comprising a sequence as shown in Figures 
3 or 4 , or functionally active derivative thereof. 



17. A vaccine comprising a recombinant PoEV polypeptide 
according to any one of claims 13 to 16, or an inactivated 
PoEV virus and a pharmaceutical ly acceptable carrier. 



18. An antibody or fragment thereof capable of binding to a 
polypeptide or fragment according to any one of claims 13 
or 16. 



19. A polynucleotide primer which is PoEV specific. 



20. 



A polynucleotide probe which is capable of specifically 
hybridising to a PoEV polynucleotide sequence. 
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21. A probe or a primer according to claims 19 or 20 which has 
substantial nucleotide sequence identity with a strand of 
the molecule depicted in Figures 1,2,3 or 4 or a strand 
complementary therewith, with a corresponding RNA molecule, 
or with a part of such a molecule. 

22. A PoEV detection kit comprising a polynucleotide primer or 
probe according to any of claims 19 to 21. 

23. Use of a PoEV specific polynucleotide in the detection of 
PoEV in a sample. 

24. Use of a PoEV specific polynucleotide in a PGR for the 
detection of PoEV in a sample. 

25. A pig modified so as to not express an infectious PoEV 
capable of infecting humans. 

26. Cells, tissues or organs obtainable from a pig accoding to 
claim 25. 

27. Use of a recombinant PoEV polypeptide according to any one 
of claims 13 to 16 in the preparation of a vaccine. 

28 • Use of a polynucleotide primer or probe according to any of 
claims 19 to 21 in the preparation of a detection kit 
capable of detecting the presence of PoEV nucleic acid in 
a sample. 
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29- Use of a polynucleotide; polypeptide; cells, tissues or 
organs according to any one of claims 1 to 7, 13 to 16 or 
26 in therapy or diagnosis. 

30. A polynucleotide; polypeptide; cells, tissues or organs 
according to any one of claims 1 to 7, 13 to 16 or 2 6 in 
the preparation of a medicament for use in therapy or 
diagnosis. 

31. The invention substantially as hereinbefore described. 
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• « • • • • 

1 GAATTCGCGGCCGCGTCGACAGATGCCTTCTTCTGCCTGAGATTACACCCCACTAGCCAA 60 

• • • . • • • 

6 1 CCACTTTTTGCCTTCGAATGGAGAGATCCAGGTACGGGAAGAACCGGGCAGCTCACCTGG 120 

12 1 ACCCGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATCTTTGACGAAGCCCTACACAGG 180 

• , • • • • 

181 GACCTGGCCAACTTCAGGATCCAACACCCTCAGGTGACCCTCCTCCAGTACGTGGATGAC 24 0 

241 CTGCTTCTGGCGGGAGCCACCAAACAGGACTGCTTAGAAGGTACGAAGGCACTACTGCTG 300 

301 GAATTGTCTGACCTAGGCTACAGAGCCTCTGCTAAGAAGGCCCAGATTTGCAGGAGAGAG 360 

361 GTAACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGCTGACGGAGGCACGGAAG 42 0 

421 AAAACTGTAGTCCAGATACCGGCCCCAACCACAGCCA^ACA^GTGAGAGAGTTTTTGGGG 4 80 

4 81 ACAGCTGGATTTTGCAGACTGTGGATCCCGGGGTTTGCGACCTTAGCAGCCCCACTCTAC 54 0 

541 CCGCTAACCAAAGAAPlAAGGGGGATTCTCCTGGGCTCCTGAGCACCAGAAGGCATTTGAT 600 

601 GCTATCAAAAAGGCCCTGCTGAGCGCACCTGCTCTGGCCCTCCCTGACGTAACTAA'^CCC 660 

6 61 TTTACCCTTTATGTGGATGAGCGTAAGGGAGTAGCCCGAGGAGTTTTAACCCAAACCCTA 720 

721 GGACCATGGAGGAGACCTGTTGCCTACCTGTCAAAGAAGCTTGATCCTGTAGCCAGTGGT 7 8 0 

781 TGGCCCGTATGTCTGAAGGCTATCGCAGCTGTGGCCATACTGGTC^AGGACGCTGACAAA 8 4 0 

8 41 TTGACTTTGGGACAGAATATAACTGTAATAGCCCCCCATGCATTGGAGAACATCGTTCGG 900 

901 CAGCCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAAAGCCTGCTTCTC 960 

961 ACAGAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACTCTTCTGCCTGAA 1020 

1021 GAGACTGATGAACCAGTGACTCATGATTGCCATCAACTATTGATTGAGGAGACTGGGGTC 108 0 
1081 CGCAAGGACCTTACAGACATACCGCTGACTGGAGAAGTGCTAACCTGGTTCACTGACGGA 114 0 
1141 AGCAGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGCAGTGGTGGACGGGACCCGC 1200 
1201 ACGATCTGGGCCAGCAGCCTGCCGGAAGGAACTTCAGCGCAAAAGGCTGAGCTCATGGCC 1260 

SUBSTITUTE^HEET (RULE 26) 
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^ ..... 

1261 CTCACGCAAGCTTTGCGGCTGGCCGAAGGGAAATCCATAAACATTTATACGGACAGCAGG 132 0 

1321 TATGCCTTTGCGACTGCACACGTACACGGGGCCATCTATAAACAAAGGGGGTTGCTTACC 138 0 

1381 TCAGCAGGGAGGGAAATAAAGAACAAAGAGGAAATTCTAAGCCTATTAGAAGCCTTACAT 144 0 

• • • • • • 

1441 TTGCCAAAAAGGCTAGCTATTATACACTGTCCTGGACATCAGAAAGCCAAAGATCTCATA 1500 

1501 TCTAGAGGGAACCAGATGGCTGACCGGGTTGCCAAGCAGGCAGCCCAGGCTGTTAACCTT 1560 

. ... - • 

1561 CTGCCT ATAATAGAAACGCCCAAAGCCCCAGAACCC AGACGACAGTACACCCT AGAAGAC 1620 

1 62 1 TGGCAAGAGATAAAAAAGATAGACCAGTTCTCTGAGACTCCGGAGGGGACCTGCT ATACC 168 0 

1681 TCATATGGGAAGGAAATCCTGCCCCACAAAGAAGGGTTAGAATATGTCCAACAGATACAT 17 4 0 

• • . • • • 

1741 CGTCTAACCCACCTAGGAACTAAACACCTGCAGCAGTTGGTCAGAACATCCCCTTATCAT 18 00 

1801 6TTC7GAGGCTACCAGGAGTGGCTGACTCGGTGGTCAAACATTGTGTGCCCTGCCAGCTG 1860 

1861 GTTAATGCTAATCCTTCCAGAATACCTCCAGGAAAGAGACTAAGGGGAAGCCACCCAGGC 1920 

• . . • • 

1921 GCTCACTGGGAAGTGGACTTCACTGAGGTAAAGCCGGCTAAATACGGAAACAAATATCTA 1980 

1981 TTGGTTTTTGTAGACACCTTTTCAGGATGGGTAGAGGCTTATCCTACTAAGAAAGAGACT 204 0 

2041 TCAACCGTGGTGGCTAAGAAAATACTGGAGGAAATTTTTCCAAGATTTGGAATACCTAAG 210 0 

2101 GTAATAGGGTCAGACAATGGTCCAGCTTTCGTTGCCCAGGTAAGTCAGGGACTGGCCAAG 2160 

• , • • • • 

2161 ATATTGGGGATTGATTGGAAACTGCATTGTGCATACAGACCCCAAAGCTCAGGACAGGTA 2220 

2221 GAGAGGATGAATAGAACCA7TAAAGAGACCCTTACCAAATTGACCACAGAGACTGGCATT 22 80 

2281 AATGATTGGATGGCTCTCCTGCCCTTTGTGCTTTTTAGGGTGAGGAACACCCCTGGACAG 2 3 4 0 

2341 TTTGGGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCCGTTGGCAGAAATTGCC 24 00 

24 01 TTTGCACATAGTGCTGATGTGCTGCTTTCCCAGCCTTTGTTCTCTAGGCTCAAGGCGCTC 24 60 

24 61 GAGTGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAGGAGGAGACTTG 252 0 

SUBSTITUTE SHEET (RULE 26) 
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2521 CAAGTTCCACATCGCTTCCAAGTTGGAGATTCAGTCTATGTTAGACGCCACCGTGCAGGA 2 5 8 0 

2581 AACCTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACACCAACGGCTGTG 2 64 0 

2641 AAAGTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTTAAGCCGGCGCCACCTCCC 27 00 

2701 GATTCGGGGTGGAAAGCCGAAAAGACTGAAAATCCCCTTAAGCTTCGCCTCCATCGCGTG 2760 

• • . . , 

27 61 GTTCCTTACTCTGTCAATAACTCCTCAAGTTAATGGTAAACGCCTTGTGGACAGCCCGAA 2 82 0 

2821 CTCCCATAAACCCTTATCTCTCACCTGGTTACTTACTGACTCCGGTACAGGTATTAATAT 28 8 0 

2881 TAACAGCACTCAAGGGGAGGCTCCCTTGGGGACCTGGTGGCCTGAATTATATGTCTGCCT 2 94 0 

2941 TCGATCAGTAATCCCTGGTCTCAATGACCAGGCCACACCCCCCGATGTACTCCGTGCTTA 3000 

3001 CGGGTTTTACGTTTGCCCAGGACCCCCAAATAATGAAGAATATTGTGGAAATCCTCAGGA 3060 

• • • • • • 

3061 TTTCCTTTGCAAGCAATGGAGCTGCATAACTTCTAATGATGGGAATTGGAAATGGCCAGT 3120 

3121 CTCTCAGCAAGACAGAGTAAGTTACTCTTTTGTTAACAATCCTACCAGTTATAATCAATT 3180 

3181 TAATTATGGCCATGGGAGATGGAAAGATTGGCAACAGCGGGTACAAAAAGATGTACGAAA 32 4 0 

3241 TAAGCAAATAAGCTGTCATTCGTTAGACCTAGATTACTTAAAAATAAGTTTCACTAAAAA 3300 

3301 AAAAAAAAAAAAAAAAAAAA 3320 
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1 TGTGGGCCCCAGCGCGCTTGGAATAAAAATCCTCTTGCTGTTTGCATCAAGACCGCTTCT 60 

6 1 CGTGAGTGATTTGGGGTGTCGCCTCTTCCGAGCCCGGACGAGGGGGATTGTTCTTTTACT 120 

121 GGCCTTTCATTTGGTGCGTTGGCCGGGAAATCCTGCGACCACCCCTTACACCCGAGAACC 180 

181 GACTTGGAGGTAAAGGGATCCCCTTTGGAACGTGTGTGTGTGTCGGCCGGCGTCTCTGTT 24 0 

241 CTGAGTGTCTGTTTTCGGTGATGCGCGCTTTCGGTTTGCAGCTGTCCTCTCAGACCGTAA 300 

301 GGACTGGAGGACTGTGATCAGCAGACGTGCTAGGAGGATCACAGGCTGCCACCCTGGGGG 360 

361 ACGCCCCGGGAGGTGGGGAGAGCCAGGGACGCCTGGTGGTCTCCTACTGTCGGTCAGAGG 4 2 0 

421 ACCGAGTTCTGTTGTTGAAGCGAAAGCTTCCCCCTCCGCGGCCGTCCGACTCTTTTGCCT 4 8 0 

4 81 GCTTGTGGAAGACGCGGACGGGTCGCGTGTGTCTGGATCTGTTGGTTTCTGTCTCGTGTG 54 0 

541 TCTTTGTCTTGTGCGTCCTTGTCTACAGTTTTAATATGGGACAGACAGTGACTACCCCCC 600 

601 TTAGTTTGACTCTCGACCATTGGACTGAAGTTAGATCCAGGGCTCATAATTTGTCAGTTC 660 

661 AGGTTAAGAAGGGACCTTGGCAGACTTTCTGTGCCTCTGAATGGCCAACATTCGATGTTG 720 

721 GATGGCCATCAGAGGGGACCTTTAATTCTGAAATTATCCTGGCTGTTAAGGCAATCATT7 7 8 0 

781 TTCAGACTGGACCCGGCTCTCATCCTGATCAGGAGCCCTATATCCTTACGTGGCAAGATT 8 4 0 

841 TGGCAGAAGATCCTCCGCCATGGGTTAAACCATGGCTAAATAAACCAAGAAAGCCAGGTC 900 

901 CCCGAATCCTGGCTC7TGGAGAGAAAAACAAACACTCGGCCGAAAAAGTCGAGCCCTCTT 960 

961 CCTCGTATCTACCCCGAGATCGAGGAGCCGCCGACTTGGCCGGAACCCCAACCTGTTCCC 1020 

1021 CCACCCCCTTATCCAGCACAGGGTGCTGTGAGGGGACCTCTGCCCCTCCTGGAGCTCCGG 1080 

1081 TGGTGGAGGGACCTGCTGCCGGGACTCGGAGCCGGAGAGGCGCCACCCCGGAGCGGACAG 114 0 

1141 ACGAGATCGCGATATTACCGCTGCGCACCTATGGCCCTCCCATGCCAGGGGGCCAATTGC 1200 

1201 AGCCCCTCCAGTATTGGCCCTTTTCTTCTGCAGATCTCTATAATTGGAAAACTAACCATC 12 60 
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1261 


CCCCTTTCTCGGAGGATCCCCAACGCCTCACGGGGTTGGTGGAGTCCCTTATGTTCTCTC 


1320 


1323 


• 

. ACCAGCCTACTTGGGATGATTGTCAACAGCTGCTGCAGACACTCTTCACAACCGAGGAGC 


1360 


1381 


• 

GAGAGAGAATTCTGTTAGAGGCTAGAAAAAATGTTCCTGGGGCCGACGGGCGACCCACGC 


1440 


1443 


AGTTGCAAAATGAGATTGACATGGGATTTCCCTTGACTCGCCCCGGTTGGGACTACAACA 


1500 


1503 


CGGCTGAAGGTAGGGAGAGCTTGAAAATCTATCGCCAGGCTCTGGTGGCGGGTCTCCGGG 


1560 


1563 


GCGCCTCAAGACGGCCCACTAATTT GGCTAAGGTAAGAGAGGTGATGCAGGGACCGAACG 


1620 


1623 


AACCTCCCTCGGTATTTCTTGAGAGGCTCATGGAAGCCTTCAGGCGGTTCACCCCTTTTG 


1680 


1663 


ATCCTACCTCAGAGGCCCAGAAAGCCTCAG7GGCCCTGGCCTTCATTGGGCAGTCGGCTC 


1740 


1743 


TGGATATCAGGAAGAAACTTCAGAGACTGGAAGGGTTACAGGAGGCTGAGTTACGTGATC 


1800 


1803 


* • • ♦ . 
TAGTGAGAGAGGCAGAGAAGGTGTATTACAGAAGGGAGACAGAAGAGGAGAAGGAACAGA 


1860 


1863 


GAAAAGAAAAGGAGAGAGAAGAAAGGGAGGAAAGACGTGATAGACGGCAAGAGAAGAATT 


1920 


1923 


TGACTAAGATCTTGGCCGCAGTGGTTGAAGGGAAGAGCAGCAGGGAGAGAGAGAGAGATT 


1980 


1983 


• ' • • . 
TTAGGAAAATTAGGTCAGGCCCTAGACAGTCAGGGAACCTGGGCAATAGGACCCCACTCG 


2040 


2043 


ACAAGGACCAGTGTGCGTATTGTAAAGAAAAAGGACACTGGGCAAGGAACTGCCCCA.aLGA 


2100 


2103 


AGGGAAACAAAGGACCGAAGTCCTAGCTCTAGAAGAAGATAAAGATTAGGGGAGACGGGT 


2160 


2161 


TCGGACCCCCTCCCCGAGCCCAGGGTAACTTTGAAGGTGGAGGGGCAACCAGTTGAGTTC 


2220 


2221 


CTGGTTGATACCGGAGCGGAGCATTCAGTGCTGCTACAACCATTAGGAAAACTAAAAGAA 


2280 


2281 


AAAAAATCCTGGGTGATGGGTGCCACAGGGCAACGGCAGTATCCATGGACTACCCGAAGA 


2340 


2341 


• • • • . 
ACCGTTGACTTGGGAGTGGGACGGGTAACCCACTCGTTTCTGGTCATCCCTGAGTGCCCA 




2401 


GTACCCCTTCTAGGTAGAGACTTACTGACCAAGATGGGAGCTCAAATTTCTTTTGAACAA 


2460 


2461 


GGAAGACCAGAAGTGTCTGTGAATAACAAACCCATCACTGTGTTGACCCTCCAATTAGAT 


2520 
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2521 GATGAATATCGACTATATTCTCCCCAAGTAAAGCCTGATCAAGATATACAGTCCTGGTTG 2580 

• • • ■ 

2581 GAGCAGTTTCCCCAAGCCTGGGCAGAAACCGCAGGGATGGGTTTGGCAAAGCAAGTTCCC 2 64 0 

2641 CCACAGGTTATTCAACTGAAGGCCAGTGCTACACCAGTATCAGTCAGACAGT^^ 2700 

2701 AGTAGAGAGGCTCGAGAAGGAATTTGGCCGCATGTTCAAAGATTAATCCAACAGGGCATC 27 60 

27 61 CTAGTTCCTGTCCAATCCCCTTGGAATACTCCCCTGCTACCGGTTAGGAAGCCTGGGACC 282 0 
2821 AATGATTATCGACCAGTACAGGACTTGAGAGAGGTCAATAAAAGGGTGCAGGACATACAC 28 80 

28 81 CCAACGGTCCCGAACCCTTATAACCTCTTGAGCGCCCTCCCGCCTGAACGGAACTGGTAC 2 94 0 
2941 ACAGTATTGGACTTAAAAGATGCCTTCTTCTGCCTGAGATTACACCCCACTAGCCAACCA 3 000 
3001 CTTTTTGCCTTCGAATGGAGAGATCCAGGTACGGGAAGAACCGGGCAGCTCACCTGGACC 3 060 
3061 CGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATCTTTGACGAAGCCCTACACAGGGAC 3120 
3121 CTGGCCAACTTCAGGATCCAACACCCTCAGGTGACCCTCCTCCAGTACGTGGATGACCTG 3180 
3191 CTTCTGGCGGGAGCCACCAAACAGGACTGCTTAGAAGGTACGAAGGCACTACTGCTGGAA 324 0 
324: TTGTCTGACCTAGGCTACAGAGCCTCTGCTAAGAAGGCCCAGATTTGCAGGAGAGAGG7A 33 00 
3301 ACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGCTGACGGAGCCACGGAAGA^A 33 60 
3361 ACTGTAG7CCAGATACCGGCCCCAACCACAGCCAAACAAGTGAGAGAGTTTTTGGGGACA 3 420 
3 4 2 1 _ GCTGGATTTTGCAGACTGTGGATCCCGGGGTTTGCGACCTTAGCAGCCCCACTCTACCCG 34 8 0 
3 4 S 1 CTAACCAAAGAAAAAGGGGGATTCTCCTGGGCTCCTGAGCACCAGAAGGCATTTGATGCT 354 0 

354 1 ATCAAAAAGGCCCTGCTGAGCGCACCTGCTCTGGCCCTCCCTGACGTAACTAAACCCTTT 3600 

• • • • • " 

3601 ACCCTTTATGTGGATGAGCGTAAGGGAGTAGCCCGAGGAGTTTTAACCCAAACCCTAGGA 3660 

366 1 CCATGGAGGAGACCTGTTGCCTACCTGTCAAAGAAGCTTGATCCTGTAGCCAGTGGTTGG 3720 

372 1 CCCGTA7GTCTGAAGGCTATCGCAGCTGTGGCCATACTGGTCAAGGACGCTGACAAATTG 37 30 
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37 81 ACTTTGGGACAGAATATAACTGTAATAGCCCCCCATGCATTGGAGAACATCGTTCGGCAG 38 4 0 

3841 CCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAAAGCCTGCTTCTCACA 3900 

3901 GAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACTCTTCTGCCTGAAGAG 3560 

3961 ACTGATGAACCAGTGACTCATGATTGCCATCAACTATTGATTGAGGAGACTGGGGTCCGC 4 02 0 

4021 AAGGACCTTACAGACATACCGCTGACTGGAGAAGTGCTAACCTGGTTCACTGACGGAAGC 408 0 

4081 AGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGCAGTGGTGGACGGGACCCGCACG 414 0 

4141 ATCTGGGCCAGCAGCCTGCCGGAAGGAACTTCAGCGCAAAAGGCTGAGCTCATGGCCCTC 4200 

4201 ACGCAAGCTTTGCGGCTGGCCGAAGGGAAATCCATAAACATTTATACGGACAGCAGGTAT 4260 

4261 GCCTTTGCGACTGCACACGTACACGGGGCCATCTATAAACAAAGGGGGTTGCTTACCTCA 432 0 

4321 GCAGGGAGGGAAATAAAGAACAAAGAGGAAATTCTAAGCCTATTAGAAGCCTTACATTTG 4 38 0 

4381 CCAAAAAGGCTAGCTATTATACACTGTCCTGGACATCAGAAAGCCAAAGATCTCATATCT 4 4 4 0 

4 441 AGAGGGAACCAGATGGCTGACCGGGTTGCCAAGCAGGCAGCCCAGGCTGTTAACCTTCTG 4 500 

4 501 CCTATAATAGAAACGCCCAAAGCCCCAGAACCCAGACGACAGTACACCCTAGAAGACTGG 4 560 

4561 CAAGAGATAAAAAAGATAGACCAGTTCTCTGAGACTCCGGAGGGGACCTGCTATACCTCA 4 62 0 

4621 TATGGGAAGGAAATCCTGCCCCACAAAGAAGGG7TAGAATATGTCCAACAGATACATCGT 4 68 0 

4 681 CTAACCCACCTAGGAACTAAACACCTGCAGCAGTTGGTCAGAACATCCCCTTATCATGTT 4 74 0 

4741 CTGAGGCTACCAGGAGTGGCTGACTCGGTGGTCAAACATTGTGTGCCCTGCCAGCTGGTT 4 800 

4 801 AATGCTAATCCTTCCAGAATACCTCCAGGAAAGAGACTAAGGGG.aiAGCCACCCAGGCGCT 4 8 60 

4 861 CACTGGGAAGTGGACTTCACTGAGGTAAAGCCGGCTAAATACGGAAACAAATA7CTATTG 4 92 0 

4 921 GTTTTTGTAGACACCTTTTCAGGATGGGTAGAGGCTTATCCTACTAAGAAAGAGACTTCA 4 980 

4 981 ACCGTGGTGGCTAAGAAAATACTGGAGGAAATTTTTCCAAGATTTGGAATACCTA2VGGTA 504 0 

SUBSTITUTE SHEET (RULE 26) 
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5100 



5041 ATAGGGTCAGACAATGGTCCAGCTTTCGTTGCCCAGGTAAGTCAGGGACTGGCCAAGATA 
5101 TTGG6GATTGATTGGAAACTGCATTGTGCATACAGACCCCy^GCTCAG(b^CAGGTAGAG 5160 
5161 AGGATGAATAGAACCATTAAAGAGACCCTTACCAAATTGACCACAGAGACTGGCATTAAT 5220 
5221 GATTGGATGGCTCTCCTGCCCTTTGTGCTiTTTAGGGTGAGGAACACCCCTGGA<:y>.GTTT 528 0 
5281 GCSGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCCGTTGGCAGAAATTGCCTTT 534 0 
5341 GCACATAGTGCTGATGTGCTGCTTTCCCAGCCTTTGTTCTCTAGGCTCAAGGCGCTCGAG 54 00 

5401 TGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAGGAGGAGACTTGCAA 54 60 

54 61 GTTCCACATCGCTTCCAAGT7GGAGATTCAGTCTATGTTAGACGCCACCGTGCAGGAAAC 552 0 

5521 CTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACACCAACGGCTGTGAAA 558 0 

5581 GTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTtAaGCYGGCGCCACCTCCCGAC 5 64 0 

5641 TCGGGGTGGAGAGCCGAAAAGAcTGAGAATCCCCTTAAGCTTCGCCTCcAtCGCCTGGTT 5700 

57 01 CCTTACTCTAACAATAACTCCCCAGGCCAGTAGTAAACGCCTTATAGACAGCTCGAACCC £760 

5761 CCATAGACCTTTATCCCTTACCT6GCTGATTATTGACCCTGATACGGGTGTCACTGTAAA 5 8 20 

5321 TAGCACTCGAGGTGTTGCTCCTAGAGGCACCTGGTGGCCTGAACTGCATTTCTGCCTCCG 583 0 

5881 ATTGATTAACCCCGCTGTTAARAGCACACCTCCCAACCTAGTCCGTAGTTATGGGTTCTA £94 0 

5941 TTGCTGCCC^GGCACAGAgAaAGAGAAATACTGTGGGGGTTCTGGGGAATCCTTCTGTAG 6000 

6001 GAGATGGAGCTGCGTCACCTCCAACGATGGAGACTGGAAATGGCCGATCTCTCTCCA6GA 6060 

6061 CCGGGTAAAATTCTCCTTTGTCAATTCCGGCCCGGGCAAGTACAAAATGATGAAACTATA 5120 

6121 TAAAGATAAGAGCT5CTCCCCATCAGACTTAGATTATCT.AA.AGA7AAGTTTCAC7GAAAG 6180 

6181 GAAAACAGGAAAATATTCA^GTGGATA^TGGTATGAGCTGGGGAATAGTTTTTTATT 62 4 0 

62 4 1 ATATGGCGGGGGAGCAGGGTCCACTTTAACCATTCGCCTTAGGATAGAGACGGGGACAGA 6300 
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6301 ACCCCCTGT^CAATGGGACCCGATAAAGTACTGGCTGAACAGGGGCCCCCGGCCCTGGA 
6361 GCCACCGCATAACTTGCCGCTGCCCCAATTAACCTCGCT6CGGCCT6ACATAACACAGCC 6420 
6421 GCCTAGCAACAGTACCACTGGATTGATTCCTACCAACACGCCTAGAAACTCCCCAGGTGT 6480 
6481 TCCTGTTAA^CAGGACAgAgACTCTTCAGTCTCATCCAGGGAGCTTTCCAAGCCATCAA 6540 
6541 CTCCACCGAacCTGATGCcAcTTCnrCTTGTTGGCTTTGTCTATCCTCAGGGCCTCCTTA 6600 
6601 "ATGAGGG^TGGCTAAAGAAAGAAAAl4cAATGTGACCAAAGAGCATAGAAATCAATG 6660 
6661 TACATGGGGGTCCCGAAA7AAGCTTACCCTCACTGAAGTTTCCGGGAAGGGGACATGCAT 6720 
6721 AGGAAAAGCTCCCCCATCCCACCAACACCTTTGCTATAGTACTGTGGTTTATGAGCAGGC 6780 
6781 -TCAGAAAATCAGTATTTAGTACCTGGTTATAACAGGTGGTGGGCATGcAaTACTGGGTT 6840 
684 1 AACCCCCTGTGTTTCCACCTCAGTCTTCAACCAATCCAAAGATTTCTGTGTCATGGTCCA 6900 
6901 AATCGTCCCCCGAGTGTACTACCATCCTGAGGAAGTGGTCCTTGATGAATATGACTATCG 6960 

6961 GTATAACCGACCAAAAAGAGAACCCGTATCCCTTACCCTAGCTGTAATGC7CGGATTAGG 7020 

7021 SACGGCCGTTGGCGTAGGAACAGGGACAGCTGCCCTGATCACAGGACCACAGCAGCTAGA 7080 

7061 GAAAGGACTTGGTGAGCTACATGCGGCCATGACAGAAGATCrCCGAGCCTTAAAGGAGTC 7140 

7141 • STTAGCAACCTAGAAGAGTCCCTGACTTCTTTG7CTGAAGTGGTTCTACAGAACCGGAG 7200 

720: <^«ATTAGATCTGCTGTTTCTAAGAGAAGGTGGGTTATGTGCAGCCTTAAAAGAAGAATG 7260 

•>2e: --GCTTCTATG7AGATCACTCAGGAGCCATCAGAGACTCCATGAACAAGCTTAGAAAAAA 7320 

7321 =T7AGAGAGGCG7CGAAGGGAAAGAGAGGC7GACCAGGGG7GG777GAAGGA7GG77CAA 738 0 

7381 =AGG7C7CC77GGA7GACCACCCTGC777C7GC7CTGACGGGGCCCCTAGTAGTCC7GCT 7440 

7441 ="ST7AC77ACAG77GGGCC77GCT7AA77AATAGGTT7GTTGCCTTTG77AGAGAACG 7500 

7501 «=T=AGTGCAGTCCAGA7CATGGTACTTAGGCAACAG7ACCAAGGCCT7CTGAGCCAAGG 7560 
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7561 


AGAAACTGACCTCTAGCCTTCCCAGTTCT^VGATTAGAACTATTAACAAGACAAGAAGTG 


7620 


7621 


• . , 

GGGAATGAAAGGATGAAAATGCAACCTAACCCTCCCAGAACCCAGGAAGTTAATAAAAAG 


7680 


7681 


CTCTAAATGCCCCCGAATTMCAGACCCTGCTGGCTGCCAGTAAATAGGTAGAAGGTCAcA 


7740 


7741 


CTTCCTATTGTTCCAGGGCCTGCTATCCTGGCCTAAGTA^GATAACAGG^^ATGAGTTGA 


7800 


7801 


ctaatcgcttatctggattctgtaaaactgactggcaccAtagaagaattgattacacat 


7860 


7861 


TGACAGCCCTAGTGACCTATCTCAACTGC^^TCTGTCACTCTGCCCAGGAGCCCACGCAG 


7920 


7921 


ATGCGGACCTCCGGAGCTATTTTAAAATGATTGGTCCACGGAGC3CGGGCTCTC3ATATT 


7930 


79S1 


TTAAAATGATTGGTCCATGGAGCGCGGGCTCTCGATATTTTAAAATGATTGGTTTGTGAC 


8040 


8041 


* 4 ^'^'^'^'^^'-^^AiAAAAGCiGTCCCGATTCCGCACTCGGGGCCGCAGT 


8100 


8101 


CCrCTACCCCTGCGTGGTGTACGACTGTGGGCCCCAGCGCGCTTGGAATAAAAATCCTCT 


3160 


8161 


TGCTGTTTGCATCAAAAAAAAAAAAAAAA^WVAAA S i 9 6 
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so 

120 



1 ^STGGTGTACGACTGTGGGCCCCAGCGCGCTTGGAATAAA^ATCCTCTTGCTGTTTGCATC 

61 AAGACCGCTTCTCGT6AGTGATTTGGGGTGTCGCCTCTTCCGAGCCCGGACGAGGGGGAT 

121 TGTTCTTTTACTGGCCTTTCATTTGGTGCGTTGGCCGGGAAATCCTGCGACCACCCCTTA 180 

1 8 1 ^CCCGAGAACCGACTTGGAGGTAAAGGGATCCCCTTTGGAACGTGTGTGTGTGTCGGCC 2 4 0 

241 «<5CGTCTCTGTTCTGAGT6^CTGTTTTCG^GATGCGCGCTTTCGGTTTGCAGCTGTCCr 300 

301 ^TCAGACCGTAAGGACTGGAGGACTGTGATCAGCAGACGTGCTAGGAGGATCACAGGCTG 360 

361 CCACCCTGGGGGACGCCCCGGGAGGTGGGGAGAGCCAGGaACGCCTGGTGGTCTCCTACT 420 

4 2 1 ^ ' CGGTCAG^GGACCGAGTTCTGTTGTTG^GCGAAAGCTTCCCCCTCCGCGGCCGTCCG 4 8 0 

481 ACTCTTTTGCCTGCTTGTGGAAGACGCGGACGGGTCGCGTGTGTCTGGATCTGTTGGTTT 540 

541 "<^TCTCGTGTGTCTTTGTCTTGTGCGTCCTTGTCTACAGTTTTAATATGGGACAGACAG 600 

MetGlyGlnThrV 
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1081 CTGGAGCTCCGGTGGTGGAGGGACCTGCTGCCGGGACTCGGAGCCGGAGAGGCGCCACCC 114 0 
roGlyAlaProValValGluGlyProAlaAlaGiyThrArgSetArgArgGlyAlaThrP 

1141 CGGAGCGGACAGACGAGATCGCGATATTACCGCTGCGCACCTATGGCCCTCCCATGCCAG 12 00 
roGluArgThrAspGluIleAlalleLeuProLeuArgThrTyrGlyProProMetProG 

1201 GGGGCCAATTGCAGCCCCTCCAGTATTGGCCCTTTTCrrCTGCA^^ 1260 
lyGlyGlnLeuGlnProLeuGlnTyrTrpProPheSerSerAlaAspLeuTyrAsnTrpL 

1261 AAACTAACCATCCCCCTTTCTCGGAGGATCCCCAACGCCTCACGGGGTTGGTGGAGTC 1320 
ysThrAsnHisProProPheSerGluAspProGinArgLeuThrGlyLeuValGlSserL 

1-321 TTATGTTCTCTCACCAGCCTACTTGGGATGATTGTCA«^CAGCTGCTGCAGACACTC^-CA . 3on 
euMetPheSerHisGlnProThrTrpAspAspCysGlnGlnLeuLeuGlnVh^nu^^e? 

1381 CAACCGAGGAGCGAGAGAgAaTTCTGTTAGAGGCTAGAAAAAATGTTCCTGGGGCCGACG 144 0 
hrThrGluGluArgGluArglleLeuLeuGluAlaArgLysAsnValProGlyAlaXspG 

lyArgProThrGlnLeuGlnAsnGluIleAspMetGlyPheProLeuThrArgProGlyT 

1501 GGGACTACAACACGGCTGAAGGTAGGGAG^GCTTGAAAATCTATCGCCAGGCTCTGGTGG 156 0 
rpAspTyrAsnThrAlaGluGlyArgGluSerLeuLysIleTyrArgGlnAlaLeuValA 

ff^ff P^f CTcAagACGGCCCACTAATTTGGCTAAGGTAAGAGAGGTGATGC 162 0 

laGlyLeuArgGlyAlaSerArgArgProThrAsnLeuAlaLysValArgGluValMecG 

1 62 1 AGGGACCGAACGAACCTCCCTCGGTATTTCTTGAGAGGCTCATGGAAGCCTTCAGGCGGT 1 68 0 
InGlyProAsnGluProProSerValPheLeuGluArgLeuMecGluAlaPheArgArgP 

1681 TCACCCCTTTTGATCCTACCTCAGAGGCCCAGAAAGCCTCAGTGGCCCTGGCCTTCAT-G 174 0 
heThrProPheAspProThrSerGluAiaGlnLysAlaSarValAlaLeuAlaP;en;G 

lyGlnSerAlaLeuAspIleArgLysLysLeuGlnArgLeuGluGlyLeuGlnGluAlaG 

luLeuArgAspLeuValArgGluAlaCluLysValTyrTyrArgArgGluThrGluGluG 

J°j;^^tf?t''^''^''^''''^'''^^''=^^^^='='^^<=^^CGTGATAGACGGC 1920 
luLysGluGloArgLysGluLysGluArgGluGluArgGluGiuArgArgAspArgAroG 

^^^^ tJ^tI:f''^^^r'^^?^^'^''*^^^°°^^^<=^*^^*^^TTGAAGGGAAGAGCAGCAGGGAGA 1^80 
InGluLysAsnLeuThrLysIleLeuAlaAlaValValGluGlyLysS^rSerArgCluA * 

1981 GAGAGAGAGATTTTAGGAAAATTAGGTCAGGCCCTAGACAGTCAGGGAACCTGGGCAATA 2 04 0 
rgGluArgAspPheArgLysIleArgSerGlyProArgGlnSerGlyAsnLeuGlyAsAA 
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r^ilp'^'d^'T^ 2100 
rgThrProLeuAspLysAspGlnCysAlaTyrCysLysGluLysGlyHisTrpAlaAr^A 

2281 GA^^CTAAAAGAAAAAAAATCCTGGGTGATGGGTGCCACAGGGCAACGGCAG-'ATCr^- 
lyLysLeuLysGluLy,tysSe.TrpValM«GlyAlaThrGlyGTiJ?^G?n;JIpS; 

2761 ycCAACASOGCATCCTAGTTCCTGTCCAATCCCCTTOGAA-HCTCcrTr-T^— ™; 
luAr,A.nTrpTy.ThrVal.euAsp..XejJ:jSJS;^JSJ^^^S?JI;JS^ 
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3001 CCACTAGCCAACCACTTTTTGCCTTCGAATGGAGAGA7CCAG 3350 
roThrSerGlnProLeuPheAlaPheGluTrpArgAspProGlyThrGlyArgThrSlyG 

3061 AGCTCACCTGGACCCGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATCTTTGACSAAG 
InLeuThrTrpThrArgLeuProGlnGlyPheLysAsnSerProThrllePheAapGluA 

3121 CCCTACACAGGGACCTGGCCAACTTCAGGATCCAACACCCTCAGGTGACCCTCCTC'-AGT 
laLeuHisArgAspLeuAlaAsnPheArgIleGlnHisProGlnValThrLeuI.euGlnT 

31B1 ACGTGGATGACCTGCTTCTGGCGGGAGCCACCAAACAGGACTGCTTAGAAGGTACGAAGG 32 4 
yrValAspAspLeuLeuLeuAlaGlyAlaXhrLysGlnAspCysLeuGluGlyThrLysA 

32 4 1 CACTACTGCTGGAATTGTCTGACCTAGGCTACAGAGCCTCTGCTAAGAAGGCCCAGAT-T 
laLeuLeuLeuGluLeuSerAspLeuGlyTyrArgAlaSerAlaLysLysAlaGlnliec 

3301 GCAGGAGAGAGGTAACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGCTGACGG 
ysArgArgGluValThrTyrLeuGlyTyrSerLeuArgGlyGlyGlnArgTrpLeuThrG 

3361 AGGCACGGAAGAAAACTGTAGTCCAGATACCGGCCCCAACCACAGCCAAACAAGTGA 3420 
luAlaArgLysLysThrValValGlnlleProAlaProThrThrAlaLysGlnvIlArgG 

34 21 AGTTTTTGGGGACAGCTGGATTTTGCAGACTGTGGATCCCGGGGTTTGCGACCTTAC- r r,^ 0 « 
luPheLeuGlyThrAlaGlyPheCysArgLeuTrpIleP.oGlySeAJShrniJr^ 

3481 CCCCACTCTACCCGCTAACCAAAGAAAAAGGGGGATTCTCCTGGGCTCCTGAGCACCAGA 
laProLeuTycProLeuThrLysGluLysGlyGlyPheS^tTrpAlaProGluHisGinL 

354 1 AGGCATTTGATGCTATCAAAAAGGCCCTGCTGAGCGCACCTGCTCTGGCCCTCCCTGACG 
ysAlaPheAspAlalleLysLysAlaLeuLeuSerAlaProAlaLeuAlaLeuPrsAspV 

3601 TAACTAAACCCTTTACCCTTTATGTGGATGAGCGTAAGGGAGTAGCCCGAGGAGTTT- 
alThrLysProPheThrLeuTyrValAspGluArgLysGlyValAlaArgGlyValLe 

^ S^^t^^^fT'^Sf ACCATGGAGGAGACCTGTTGCCTACCTGTCAAAGAAGCTTGATCC-3 
HrGlnThrLeuGlyProTrpArgArgProValAlaTyrLeuSerLysLysLeuAspProV 

* Itl? fJJ^^ri^'^f ^^^'t^t^tctgaaggctatcgcagctgtggccatActggtcaagg 

alAlaSerGlyTrpProValCysLeuLysAlalieAiaAiaValAlalleLeuValLysA 

spAlaAspLysLeuThrLeuGlyGlnAsnlleThrVaill^AlaProHisAlaLeuGiuA 

3841 ACATCCTTCGGCAGCCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAAA 3900 
snlleValArgGlnProProAspArgTrpMetThrAsnAlaArgMetThrHisTyrGrJJ 

3 901 GCCTGCTTCTCACAGAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACrc 3960 
erLeuLeuLeuThrGluArgValThrPheAlaProProAlaAlaLeuAsnProAlaTh-'L 
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3600 



•AOi 3660 
uT 



3720 



780 
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3961 TTCTGCCTGAAGAGACTGATGAACCAGTGACTCATGATTGCCATCAACTATTGATTGAGG ^ 07n 
euLeuProGluGluThrA«GluProValThrHisA«CysHisGtJSu£lSnG?^^ 

4 021 AGACTTCGGTCCGCAAGGACCTTACAGAcAtacCGCTGACTGGAGAAGTGCTAACCTGGT .nftn 
luThrGlyValArgLysAspLeuThrAspIleProLeuThrGlyGluVaf^eJJSSrpP 

4081 TCACTGACGGAAGCAGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGC^^ . 1 i, n 

heThrAspGlySerSerTyrValValGluGlyLysArgMecAlaGryAlaS^SImJA 

spGxyThrArgThrlleTrpAlaSerSerLeuProGluGlyThrSerAlaGlnLysAlaG 

4201 AGCTC:ATGGCCCTCACGCA;.GCTTTGCGGCTGGCCGAAGGGAAATCCATAAACAttTAT; ,Of.n 
luLeuMetAlaLeuThxGinAlaLeuArgLeuAlaGluGlyLysSerJlt^SieJyS 

lnGlnneHisArgLeuThrHisLeuGlJ?S;^S?^^;.=SnSnnuvL"^^^^ 

4741 fCCCTTATCATGTTCTGAGGCTACCAGGAGTGGCTGACTCGGTGGTCAAACAT-GTGTrr ^ «nn 
erProTyrHisValLeuArgLeuProGlyValAlaAspServIwIJJJJSIcylSIJp 

4801 CCTGCCAGCTGGTTAATGCTAATCCTTCCAGAATACCTCCAGGAAAGAGACT- —rrr;.;; .o^n 
roCysGlnteuValAsnAlaAsnProSerArglleProProGlJi^S^S;:::^^^^^ 

f^5?;^^^°*=ff^^^C"==S^<=TGGACTTCACTGAGGTAAAGCCGGCTAAA7ACGGAA 4920 
erHisProGlyAlaHisTrpGluValA^pPheThrGiuValLysProAlaLysTyrG^JJ 
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4921 ACAAATATCTATTGGTTTTTGTAGACACCTTTTCAGGATGGGTAGAGGCTTATCCTACTA 4980 
snLysTyrLeuLeuValPheValAspThrPheSerGlyTrpValGluAlaTy.-ProThrl 

4981 AGAAAGAGACTTCAACCGTGGTGGCTAAGAAAATACTGGAGGAAATTTTTCCAAGATT'S ^04 n 
ysLysGluThrSerThrValValAlaLysLysneLeuGluGluIle;A;PrJJrgS;G 

S041 GAATACCTAAGGTAATAGGGTCAGACAATGGTCCAGCTTTC6TTGCCCAGGTAAGTCAGG SlOO 
lylleProLysVallleGlySerAspAsnGlyProAlaPheValMaSnValSelSnG 

5101 GACTGGCCAAGATATTGGGGATTGATTGgWtGCATTGTGCATACAGACCCC^^ 5160 
lyLeuAlaLysIleLeuGlylleAspTrpLysLeuHisCysAlaTyrArgProGlnSerS 

5161 CAGGACAGGTAGAGAGGATGAATAG.AACCATTAAAGAGACCCTTACCAAATTGACCACAG 

erGlyGlnValGluArgMetAsnArgThrlieLysGluThrLeuThrLyJIeShr?^^^ "° 

luThrGlylleAsnAspTrpMecAlaLeuLeuProPheValLeuPheArgValArgAsP.T 

5281 CCCCTGGACAGTTTGGGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCC-TT-^ ^^^n 
hrProGlyGlnPheGlyLeuThrProTyrGluLeuLeuTyrGlyGlyProPrcPrS^^ ° 

5341 C^GAAATTGCCTTTGCACATAGTGCTGATGTGCTGCTTTCCCAGCCT-^TGTTC-^C-AGrr ^iinn 
laGluIleAlaPheAlaHisSerAlaA3pValLeuLeuSerGlnPr;£luPheie;Jrg^ 

5401 TCAAGGCGCTCGAGTGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAG 54 60 
euLysAlaLeuGluTrpValArgGlnArgAlaTrpLysGinLeuArgCr^lAiSJrl^G 

5461 SAGGAGACTTGCAAGTTCCACATCGCTTCCAAGTTGGAGATTCAGTCTATGTTAGACGCC 5520 
lyGlyAspLeuGlnValProHisArgPheGlnValGlyAspSerValTyrSalA^gArgH 

5521 ACCGTGCAGGAAACCTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACAC 5 5 R O 
isArgAlaGlyAsnLeuGluThrArgTrpLysGlyProTyrLeuValL^^LeShrJhJp 

5581 CAACGGCTGTGAAAGTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTTAAGCCGr ^^iin 
roThrAlaValLysValGluGlylleProThrrrplleHisAlSrSwIIJJsProI 



5641 CGCCACCTCCCGACTCGGGGTGGAGAGCCGAAAAGACTGAGAATCCCCTT^AGCTTCGCC 5700 
ArgHisLeuPrcThrArgGlyGlyGluPrcLysArgLeuArglieProLeuSertnei?; 

5701 TCCATCGCCTGGTTCCTTACTCTAACA^TAACTCCCCAGGCCAGTAGTAAACGCC— ATA 5760 
euHisArgLeuVaiProTyrSerAsnAsnAsnSerProGlvGinEnd '^'''^'='=-^^^ ^760 

SecIleAlaTrpPheLeuThrLeuThrlleThrProGln.'daSerSerLysArgLeune 

5761 fACAGCTCGAACCCCCATAGACCTTTATCCCTTACCTGGCTGATTATTGACCCTG^TACG 5620 
/vspSerSerAsn?reHi5ArgProLeuSerLeuThrTr?LeuIleIleAs;p?;AShr 
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i?lvltT^rtl^''T''^^^^^^ S880 
GlyValThrValAsnSerThrArgGlyValAlaProArgGlyThrTrpTrpProGiuLeu 

5881 CATTTCTGCCTCCGATTGATTAACCCCGCTGTTAAAAGCACACCTCCCAACCTAGTCCGT SO^n 
HisPheCysLeuArgl^uIl^AsnProAlaValLysSerThrProPrSsnLewISrJ ° 

5941 AGTTATGGGTTCTATTGCTGCCCAGGCAci.GAGAAAGAGAAATACTGTGGGGGTTCTGGG finnn 
SerTyrGlyPheTyrCysCysProGlyThrGluLysGlutysTyrCysGlySylelG^^ 

6001 GAATCCTTCTGTAGGAGATGGAGCTGCGT^CCTCCAACGATGGAGACTGGAAATGGCCG ^;n-n 
GluSerPheCy,ArgArgTrpSerCysValThrSerAsnAspGlySp?rSJ???pp" 

6061 ATCTCTCTCCAGGACCGGGTAAAATTCTCCTTTGTC.^AT7CCGGCCCGGGCAAGTACA^ 61 
IleSerLeuGlnAspArgVallysPheSerPheValAsnSerGlyProGly^sTyriCJ 

S:fMl^f^P''I'''^'^*^*^°^°^^=<=T^'=^'^A"C-GACTTAGATTATC eiBO 
MecMetLysLeuTyrLysAspLysSerCysSerProSerAspLeuAspTyrL^CTyJue 

6181 J^^TTCACTGAAAGGAAAACAGGAAAAT^^^ ^^40 
SerPheThrGluArgLysThrGiyLysTyrSerLysValAspLysTrpTyrGIuLeucIy 

AsRSerPheLeuLeuTyrGlyGlyGlyAiaGlySerThrLeuThrlleArgLeuArglle 

6301 SAGACGGGGACAGAACCCCCTGTGGCAATGGGACCCGATAAAGTACTrrr-'ranr"-,.r.; 

GluThrGlyThrGluProProValAiaMecGlyp"oSspJJ^5I?^:SJ:Si&^^^^ 

6361 CCCCCGGCCCTCGAGCCACCGCATAACTTGCCGGTCCCCCAATTAACCTCGCTGCGTr' 'A-^n 
Pro?roAlaLeuGluProProHisAsnLeuProValFroGlnJS?hrSerLeS«^^^ 

J^^T'^^''^t'^'=^=^'=^^<='=^=A=TACCACTGGATT3ATTCCTACCAACACGCCTAGA 64 8 O 
AsplleThrGlnProProSerAsnSerThrThrGlyLeuIleProThrAsShrPr^Arg 

64 81 AACTCCCCAGGTGTTCCTGTTAAGACAGGACAGAGACTCTTCAGTCTCATCCAGGGArrx n 
AsnSerProGlyValProValLysThrGlyGlnArgLeuPheSer^IuJI^GJnGl^^^ 

6541 ^TCC/^GCCATCAACTCCACCGACCCTGATGCCACTTCTTCT-TGTTGGCTTTG-C-AT-- -nn 
PheGlr^laIleAsnSerThrAspProAspAlaThrSe.SerCysTrpL;:S;^;S:: 

6601 ^CAGGGCCTCCTTATTATGAGGGGATGGCTAAAGAAAGAAAATTCAATGTGACCAAA-i.r -^^n 
SerGlyProProTyrTyrGluGlvMetAlaLysGluArSJ^^JeJJnvI^^^^ 

6661 CATAGAAATCAATGTACATGGGGGTCCCGAAATAAGCTTACCCTCACTGAAGT— CCGGG 'l^n 
riisArgAsnGlnCysThrTrpGlySerArgAsnLysLeuThrLauTh^GtiSIiserG^y 

JJsGiv?:^JysSJGiO?CtJ>''p^''r^^ «^«0 

wy5^,-v i...c/sll.eGiyLysA:aProProSerHisGlnKisLeuCysTyrSerThrVal 
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6781 GTTTATGAGCAGGCCTCAgAaAATCAGTATTTAGTACCTGGTTATA^^^ 68 4 0 

ValTyrGluGlnAlaSerGluAsnGlnTyrLeuValProGlyTyrAsnArgTrpTrpAia 

6841 TGCAATACTGG6TTAACCCCCT6TGTTTCCACCTCAGTCTTCAACCAATCCAAAGATTTC 69 00 
CysAsnThrGlyLeuThrProCysValSerThrSerValPheAsnGlnSerSCJS^^ 

6901 TGTGTcy^TGGTCCAAATCckcCCCGAGTCTACTACCATCCTGAGGAAGTGGTC 6960 
CysValMetValGlnIleValProArgValTyrTyrHisProGluGluv;i5alLl^p 

6961 GAATATGACTATCGGTATAACCGACCAAaAagAGAACCCGTATCCCTTACCCTAGCTGTA 7020 
GluTyrAspTyrArgTyrAsnArgProLysArgGluProValSerLeuT^rLeS^JISIt 

7021 ATGCTCGGATTAGGGACGGCCGTTGGCGTAGGAACAGGGACAGCTGCCC-GATCACAGGA 7 08 0 
MetLeuGlyLeuGlyThrAlaValGlyValGiyThrGlyThrAlaAlaL;uIleS?Gry 

' ' ^5^?tnrt°-''''"^''r^''H'^"=="^^''"'^^^^=^^'='^CATGACAGAA 714 0 

?-0GlnGlni,euGluLysGlyLeuGlyGluLeuHisAlaAlaMetThrGluAspl.euArg 

7141 SCCTTAAAGGAGTCTGTTAGCAACCTAGAAGAGTCCCTGACTTCTTTGTCTGAAGTGGTT 7200 
AlaLeuLysGluSerValSerAsnLeuGluGluSerLeuThrSarLeuSerGluValSai 

LeuGlnAsnArgArgGlyLeuAspLeuLeuPheLeuArgGluGlyGlyLeuCysAiaAla 

7261 7TAAAAGAAGAATGTTGCTTCTATGTAGATCACTCAGGAGCCATCAGAGACTCCATGAAC 7 32 0 
i.euLysGluGluCysCysPheTyrValAspHisSerGlyAlaIieArgAspSeS«JJn 

7 321 '^GCTTAGAAAAAAGTTAGAGAGGCGTCGAAGGGAAAGAC^GGCTGACcAggGGTGGT^ 73 8 0 
LysLeuArgLysLysLeuGluArgArgArgArgGluArgGluAiaAspGlnGlyTrpPh; 

G^^Sf J^^^Ik ^^''t^""''''"'''^^^^^^<=^^^=^=^^TTCTGCTCTGACGGGGCCC 7 4 4 0 
GluGlyTrpPheAsnArgSerProTrpMetThrThrLeuLeuSerAlaLeuThrGlyPro 

LeuValValLeuLeuLeuLeuLeuThrValGlyProCysLeulleAsnArgPheValAla 

75Ci JJTGTTAGAGAAMAGTGAGTGCAGTCCAGA^ 

PheValArgGluArgValSecAlaValGlnlleMecValLeuArgGinGlnTyrGlnGly 

llllluTeTc^l't^r'^^^^^ 7620 
j-eui-eu5erGlnGlyGluThrAspLeuEnd 

621 CAAGACAAGAAGTGGGGAATGAAAGGATGAAAATGCAACCTAACCC7CCCAGAACCCAGG 7680 
681 AAGTTAATAAAAAGCTCTAAATGCCCCCGAATTACAGACCCTGCTGGCTGCCAGTAAATA 774 0 
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" 7800 



7741 GGTAGAAGGTCACACTTCCTATTGTTCCyVGGGCCTGCTATCCTGGCCTAAGTAAGATAAC 



7801 A6GAAATGAGTTGACTAATCGCTTATCTGGATTCTGTAAAACTGACTGGCACCATAGAAG 7860 

7861 AATTGATTACACATT(»CAGCCCTAGTGACCTATCTCAACTGCAATCTGTCACTCTGCCC 7920 

7921 AGGAGCCCACGCAGATGCGGACCTCCGGAGCTATTTTAAAATGATTGGTCCACG6AGCGC 79B0 

7981 GG6CTCTCGATATTTTAAAATGATTGGTCCATGGAGCGCGGGCTCTCGATATTTTAAAAT 804 0 

8041 GATTGGTTTGTGACGCACAGGCTTTGTTGTGAACCCW^TAAAAGCTGTCCCGATTCCGCA 8100 

8101 CTCGGGGCCGCAGTCCTCTACCCCTGCGTGGTGTACGACTGTGGGCCCCAGCGCGCTTGG 8160 

8161 AATAAAAATCCTCTTGCTGTTTGCATCAAAAAAAAAAAAAAAAAAAAAA 82 09 
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Figure 4. 

The same nucleotide sequence as represented bv bases 5260 -o S^^o 

in Figure 3 is also representative for this Figure, v^:V 1^1 

following changes: 

?os it ion Chance 



5273 
5341 
5351 



G-T 
C-T 
C-T 



G-A 



5353 T-C 
5356 
5426 

f"^^^ Inserzicn AGA 

d607 C^T 
5633 

5792 x-C 

Insertion A.^. 
6253 T-^ 
6255 
6900 



j.nsertion A 
C-G 



f-^™^'"''"^--^.^^'^! changes result in the following ar^'no 
cnanges m tne Ei^n/ polypeptide. ^—n^ 

Position Chance 



acid 



7 
192 



R-W 
R-K 



J^Q^ Deletion 

Deletion 



1S7 
193 
199 
200 
201 
204 
205 
206 
206 
203 
209 
211 
212 
427 



S-E 
K-N 
V-I 
D-Q 
Y-I 
E-N 

Insertions : G,M, S 
L-W 

S-V 
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rJLCure d 



SFV-3 
HSRV — 



PoEV 

GALV., ^^EV 
FeLV 
\ Ml'LV 




MMTV MPMV 



RSV 



1C% 



MuLV 

FeLV 

GaLV 

SVV-1 

SFV-3 

HSRV 

BLV 

HTLV 



murine leukaemia virus 
feline leukaemia virus 
S'bbcn ape leukaemia virus 
simian foamy virus ^ 
simian foamy virus 3 
human fcamy virus 
Bovine leukaemia virus 
^ human T-cell leukaemia virus 
MMTV murine mammar/ tumour virus 
MPMV Mason Pfi-er monkey virus 
?»SV Rous sarcoma virus 
HV feline immunodeficiency virus 
HIV human immunodeficiency virus 
ciAV ecuine infecrious anaemia virus 
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PPT I U3 OCT-1 c-MyD LyF-1 E47 

ETS.1 AP^ 

£ 1 CAGGi^J=',GTT. -Ar.^J=J^^^^^ 

MF-1 AP-1/TR 

101 GC 7 GC C AGTJ^J^AT A GGT AGJL^GGT CACA CT T C CTATTGrC CCAGGG I CT 3 

ET3-1/GATA GATA ET3-1 C-Myb A?-1 GATA 

151 CTATCCTGG CCTAAGT AAGATAJ='.CAG-GAAA.TGAGT 

EXT AP-1 

2 CI CTGGATTC TGTi=L"J LACTGACTGGCA CCA.rAGAAGAArTGAr 



E47 ETS.1 CCAA1 

3 •:■ 1 c :ACGCA.GA,TGCGGAGCTCCGG.AGCrA rrTTAA.^AT GA,rTC^ 

GATA CCAAT^ 
351 GCGCGGGC TCTCGATATTTTAJ ^a^T GATTGG TCCArGGAGCGCGGGC TCr 

GATA CCAAT4- AP-1/CRE3 

4 C 1 CGArATTTTAA AAT GATTGG TT TGTG-ACGCACAG GCTTrGrTGT'GAACZC 

TATA U3 I R 

4 5 : CArAAAAGCTGTCC CGATTCCGCACTCGGGGCCGCAGTCCTCTACr- 

PADS polyA 
3 C 1 CGTGGTGTACGACTGTGGGCCCCAGCGCGCT' 

R i US 

zzi CTGTTTG CAT C^S^GJ^jZCGG T T C T G T GAG 1 GAT T T GGGG 7 G T CGC C T CT T 

US 1 PSS 
;GATTGTTGTTTTAGT3GCCTTTCATT TGGTG7 



JwL GTTGGCCGGGAAArCCTGCGAGC 
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